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IT HAS BEEN suggested!** that the attitudes 
of students toward school subjects might be illu- 
minated by analyzing the habitual orientation of 
students toward decision or choice situations ac- 
cording to two dimensions, One dimension is 
provided by considering the degree of deliberation 
involved in making achoice: the preference for 
ends suggested by abstract, relational considera- 
tion, as contrasted with the appreciation of imme- 
diate, proximate ends; the other dimension is pro- 
vided by the relative preference for social as op- 
posed to non-social objects. 

The intersection of these two dimensions yields 
four vectors: 1) ‘‘theoretic,’’ the deliberative, 
analytic orientation toward the non-social environ- 
ment, characterized by interest in the natural 
Sciences and in mathematics; 2) “prudent, ’’ the 
deliberative orientation toward the social environ- 
ment, characterized by interest in the behavioral 
sciences; 3) ‘‘aesthetic,’’ the appreciation of the 
quality of the directly given non-social environ- 
ment; and 4) ‘‘immediate,’’ the preference for 
proximate social ends. 2 

The ranges between the poles of these four vec- 
tors provide six continua for the assessment of 
the structure of attitudes: 1) prudent vs. theoretic, 
2) prudent vs. immediate, 3) prudent vs. aes- 
thetic, 4)theoretic vs. immediate, 5) theoretic 
vs. aesthetic, and 6) aesthetic vs. immediate. 
These abstractions are summarized by the design 
in Figure 1. 

If an individual who selects relatively proximate 
ends in a variety of situations may be reliably ex- 
pected to select proximate ends in a future choice 
situation, characterizing him as ‘‘immediate’’ (or 
“aesthetic, ’’ in а non-social context) would be of 
value for the prediction of behavior. More im- 
portant for educators, however, is the possibility 
that analysis of decision behavior with the aid of 
these dimensions may provide a useful theoretical 
perspective for the evaluation of curricular objec- 
tives. An instrument which **measures"' an indi- 


California 


vidual’s position on the continua could be used to 
evaluate the effects of curricular modification. 

In this paper we will describe efforts to con- 
struct six scales insucha fashion that scale scores 
obtained by individuals will be valid indexes of 
their relative positions on the six continua. 


Construction of the Scales 


A large number of short propositions express- 
ing preference for one of two alternatives were 
written. The content of each alternative was de- 
signed to represent one pole of one continuum, and 
was paired, in the proposition, withan alternative 
relatively approximating the opposite pole, e.g., 
prudent vs. theoretic. From this fund of proposi- 
tions, thirty were selected for each of the six 
scales, making 180 items in all for thesix scales. 
The following criteria governed the editing and se- 
lection of items: 


1. Theoretical relevance of content 

2. Clarity of meaning 

3. Appropriateness of vocabulary and content 
for high school students 

4. Balance of proportion of *'positive" ув, 
“negative” items (one pole of each continu- 
um was arbitrarily designated positive or 
favorable; hence disagreeing with a negative 
item would be a positive response) 

5. Avoidance of dyslogistic phraseology or al- 
ternatives counter to cultural universals. 

6. Avoidance of factual items, responses to 
which might not reflect attitudes. 


These criteria are comparable to those suggested 
in current literature? on attitude scales except that 
each item contains two ideas, usually explicitly 
stated, though sometimes implicit, to be com- 
pared. It would seem, however, that this greater 
grammatical complexity is offset by the simplicity 
of decision in such comparison. 


* The research reported herein was performed pursuant to a contract with the United States Office of 
Education, Department of Health, Education and Welfare. 


**AI] other footnotes will be found at end of article. 
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FIGURE 1 


SCHEMATIZATION OF CONTINUA RESULTING FROM CONC URRENCES OF 
VARYING COMBINATIONS OF TWO DIMENSIONS 
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The 180 selected items were randomized with 
the provision that no two items from the same 
scale should be adjacent. The mimeographed ‘‘In- 
ventory of Choices" was administered to two 
groups of individuals: 92 high-school students in 
a required biology course, composed of four sec- 
tions, roughly stratified by the school according 
to IQ and prior achievement; and 50 undergradu- 
ate university students in two sections of an edu- 
cation course. Responses were indicated on a 
Likert-type six-point scale: “strongly agree, ’’ 
‘moderately agree, ” “slightly agree, ” “slightly 
disagree, ’’ ‘‘moderately disagree, ’’ and ‘‘strong- 
ly disagree." 

Response categories were assigned the arbi- 
trary weights, 5-0, with strongly agree to posi- 
tively phrased items, or strongly disagree to neg- 
atively phrased items, being assigned the weight 
5, etc. Where a respondent couldnot make a de- 
cision, he was instructed tocross out the number 
of that item. Such infrequent responses were as- 
signed the intermediate weight, 22. Scores on 
each scale for each individual were obtained by 
summing these arbitrary weights. 

Following Louis Guttman's “С ог nell tech- 
nique, 4 twelve tables werelaidout, one for each 
scale for each of the two samples separately. Each 
table was constructed with one column for each of 
six response categories of each item, and one row 
for eachindividual in the sample. Thus, since each 
scale had thirty items and each item contained six 
response categories, each table consisted of 180 
columns and as many rows as individuals in the 
sample. The individuals were ordered, high to 
low, for each scale, according to their summated 
scores, and listed vertically down the left side of 
each table. The responses of each individual to 
each item were indicatedby X’s inthe appropriate 
cells, the resulting tables containing all of the 
data gathered. 

The array of responses to each item was double- 
dichotomized: vertically by combining response 
categories (e.g., 5-3 considered positive, 22-0 
considered negative); horizontally at the point 
where the frequency of positive responses was 
greater above the line, and negative responses 
greater below the line. The responses in each of 
the four resulting quadrants were then tabulated 
in 2 x 2 contingency tables. The association be- 
tween high scores (dependent on positive responses 
to a majority of items) and positive responses to 
each item, was tested by the chi-squared test cor- 
rected for continuity. Two values of chi-squared 
were thus obtained for each item—one value from 
each sample, providing replicationfor judging the 
reliability of the association in differing groups. 
Figure 2 illustrates these contingency tables. 

All of the associations in the contingency tables 
were “ровіНуе,” the values of chi-squared ex- 
ceeding the . 05 level of significance inthe major- 
ity of cases. Examination of the array of ге- 


sponses to each item in the original tables, and 
their summarization in the contigency tables, 
points to the following observations: 1) while there 
was considerable **higgledy-pigglediness'"9 in the 
response patterns, the central tendency of the re- 
sponses to each item lay on a diagonal from upper 
left (high scores and high category values) to lower 
right (low scores and lowcategory values); 2) the 
confirmation of the **prediction" of anindividual's 
response to each item from his responses to all 
other items confirms (a) the direction of the a pri- 
ori weights originally assigned to the response 
‘categories of each item"; and (b) by the same token, 
the existence of a communality of content through 
all the items. 

Twelve items were selected from the thirty of 
each scale on the basis of the following criteria: 


1. The level of discriminationin the high-school 
group indicated by the value of chi-squared, 

2. The reliability of the discrimination indi- 
cated by the replication with the university 
group, 

3. The variation in the distribution of item mar- 
ginal totals, indicating the ‘‘popularity’’ of 
positive responses, in order to provide sev- 
eral ‘‘cutting points’’ in the continua. 


The twelve selected items from each of the six 
scales were then again randomized to construct a 
revised ‘‘Inventory of Choices," composed of 
seventy-two items. 


Conditions for the **Measurement'' of Attitudes 


In a uni-dimensional scale, ‘‘... persons who 
answer a given question favorably all have higher 
ranks on the scale than persons who answer the 
same question unfavorably."9 Alternativ el у, ап 
individual who has а higher scale score than an- 
other, has given positive responses to all of the 
items which were positively responded to by the 
latter, plus one or more additional items. This 
requirement gives rise to Louis Guttman’s stipu- 
lation that the responses to each item shall be a 
simple function of the persons’ ranks:10 thatthe 
responses to each item shall be reproducible from 
the scale score. If the items are arranged in or- 
der of popularity, aperson with a scale score of 3 
will have given positive responses to the three 
most popular items, etc. Thus, if we are to say 
that one individual is more “ргийеп!” than another, 
we should mean that the former has given prudent 
responses to all the items responded prudentlyto 
by the latter, and more. If this condition be ob- 
tained, an “огдша1” or “гапк order" scale would 
be constructed, 11 
° With a perfectly cumulative scale, the fre- 
quencies in the upper-negative and lower-positive 
quadrants of the 2 x 2 contingency tables, such as 
are illustrated in Figure 2, would be zero. No 
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persons with scale scores above a certain point 
would give negative responses to an item, nor 
would any persons with scale scores below that 
point give positive responses to the item. Scores 
obtained with the use of such perfectscales would 
include no ‘‘errors.’’ Designating obverse re- 
sponses as ‘‘errors’’ does not mean that one could 
confront the respondent with his “еггог” and tell 
him that he was mistaken in his attitude—'' You 
didn’t really mean this.’’ Each response must be 
assumed to define the respondent’s attitude toward 
the content of the item. Hence these “еггог5” 
are departures of the data from the model of uni- 
dimensionality. They indicate multi-dimension- 
ality in the content of the items. Confronted with 
the complexity of an itemas a whole, the respond- 
ent’s decision is based on the factors specific to 
each item—not necessarily on the factor common 
to all the items of a scale. 

If two or three major dimensions exist ina 
scale then the interpretation of a given scale score 
is ambiguous since one respondent may have at- 
tained his score by positive responses toitems of 
one dimension, while another respondent attained 
the same score by positive responses to items 
representing another dimension. For example, if 
income and education are combined as an index of 
socio-economic status, the meaning of a given 
rank is indeterminate since it may have been at- 
tained by either income or education, and these 
need not be interchangeablefor the valid predic- 
tion of an external variable. 12 

On the other hand, if each item has specific 
factors, drawing idiosyncratic responses, but all 
of the items of a scale share a single common 
factor, then the rank order established by the 
group of items should rank individuals approxi- 
mately in accord with the ‘‘ideal’’ ranking based 
on the common factor alone—specific factors 
more or less cancelling one another. 

Since, inthe formulation of the items for these 
scales, a deliberate effort was made tohave each 
item represent a continuum inadifferent context, 
each item is obviously multi-dimensional. “Ег- 
rors’’ are necessarily to beexpected. But by ex- 
amining the pattern of responses we should be 
able to make a preliminary judgment as to wheth- 
er we have a single dominant factor and numerous 
specific factors, in which case the ordering of in- 
dividuals would rest essentially on the dominant 
factor; or whether two or three major dimensions 
exist in the scale. 

If a large group of individuals who have high 
scale scores—having given positive responses to 
a majority of items—give negative responses to 
one or two items, while a group of individuals 
with lower scale scores give positive responses 
to the same one or two items; then those items, 
having blocked groups of errors, represent a dif- 
ferent dimension from the rest of the scale. On 
the other hand, if the frequency of negative re- 


sponses to each item increases continuous- 
ly as one descends the order of scale scores, 
then each item participates in a communality of 
content, and contributes to the discrimination 
of ranks in the same dimension. 1? 

Were items representing a different dimension 
obversely related to the restof the items in a lin- 
ear fashion throughout the range of scale scores, 
so that the higher the scale score the greater the 
frequency of negative responses, then the obverse 
association in the conde nsed contingency tables, 
or the diminished value of chi-squared, would re- 
veal this without a detailed examination of the pat- 
tern of responses through the range of scale scores, 
However, if such an item were positively related 
to the rest of the scale through thegreater part of 
the range of scale scores, butobversely related in 
some limited range—as if, for example, those 
most strongly ‘‘prudent,’’ along with those who are 
‘‘immediate, ’’ as indicated by total scores, should 
give ‘‘immediate’’ responses to one item, while 
those who are moderately “ргидепі” should give 
the ‘‘prudent’’ responses—the condensation of 
these responses into a 2X 2 table would obscure 
this curvilinear relationship, while a significant 
value of chi-squared might still be obtained. 

We shall be concerned, inthe preliminary anal- 
ysis of the scales, therefore, not only with the as- 
sociation of high scale scores with positive re- 
sponses to each item, to determine whether each 
item of the scale contributes to the discrimination 
of ranks in the same dimension; but also with the 
pattern of responses to determine whether this es- 
sential uni-dimensionality obtains throughout the 
range of scale scores. 

Afterwards, if the scales prove to be essential- 
ly uni-dimensional, even though errors may be 
numerous, then the cancellation of specific factors 
may be enhanced, and an approximately cumula- 
tive scale constructed, by grouping equivalently 
popular items and defining positive responses to a 
majority of each group of items as a positive re- 
sponse to the compound item. By the use of this 
‘“H-technique’’ of Samuel Stouffer, 14 the dominant 
factor is brought out more clearly by making al- 
lowance for some idiosyncratic variation. This 
reduces the number of errors, thus enhancing the 
possibility of reproducing scores from ranks and 
vice versa, but it also reduces the number of pos- 
sible ranks to which an individual may be assigned. 
In a sense, the technique helps {һе investigator 
use an instrument within its powers of resolution. 


Preliminary Analysis of the Revised Scales 


The revised inventory of seventy-two items, 
consisting of the twelve selected items for each of 
the six scales, was administered to 325 high- 
school students in college-preparatory chemistry 
and physics classes, 

The responses were again tabulated into six 


tables, using the ‘‘Cornell technique” as desc ribed 
above. The matrix of responses to each item was 
double-dichotomized and tabulated in 2 x 2 contin- 
gency tables. The association of high scale scores 
with positive responses was significant beyond the 
„001 level for each item. The values of chi- 
Squared are reported in the last row of Tables I- 
VI for the items of each scale. 

Each individual was re-scored, assigning the 
weights 1 and 0 to the new categories formed by 
the vertical dichotomization. This resulted in a 
possible range of scale scores from 12 (positive 
responses to all items) to 0 (negative responses 
to all items). The number of positive responses 
to each item given by the ind ividuals with each 
scale score was then tabulated. These are con- 
verted into proportions and reported in the body 
of Tables I-VI. (The ultimate positive frequencies 
can be recovered from the product of the propor- 
tion and the frequency at each scale score: р: Го; 
and the negative frequencies from (1 - p): fo.) 

It can be seen from inspection that the propor- 
tions of positive res ponses to each item form an 
approximately continuous gradient, such that the 
lower thescale score, the smaller the proportion 
of positive responses, subject to sampling fluctu- 


ations, 15 that each item 


ga mation of ranks by the 


items would not be 
riminations. 


Re-Scoring by the '"' H- Technique" 
а РОМИ ОЦЕ 


In order to minimize the reflection in the scale 
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itive. At Ше same time, about 25 percent of the 
respondents should give positive responses to 
three or four of the elements of compound item A, 
about 50 percent should give positive responses to 
three or four of the elements of compound item B. 
and about 75 percent should give positive re- 
sponses to three or four of the elements of com- 
pound item C. In this way, the el e ments of each 
compound item are as equivalent as possible, and 
the resulting compound items are as different as 
possible from eachother, **equivalence" and “ай- 
ference” being defined by the population of ге- 
spondents rather than by a priori judgment. This 
procedure is illustrated by the data from the pru- 
dent-theoretic scale in Table VII. 

А positive response to each of the compound 
items is defined by a modal positive response to 


the elements of that item. In other words, posi- 


tive responses to all four, or to three out of the 
four, elements of a compound item are scored as 
a positive response to the compound item; negative 
responses to two, three, or four of the four ele- 
ments are scored as a negative response to the 
compound item. 

The 325 individuals in the high- school group 
were then re-scored according to their modal ге- 
sponses to Ше compound items on each scale, The 
frequency of each possible pattern of response 10 


the compound items of the six scales is shown in 
Table VIII. 


Analysis of the Modified Scales 
— е пейосаез 


It can be seen that the patterns of reg ponses 
given by the majority of individuals were cumula- 
tive. If they gave a positive response to the least 
popular item, А, they also gave 
to the other two items; if they gave a positive re- 


» they also gave a negative re- 
Sponse to A and B 


to B, they also gave a neg 
For these ''Scale-types > 


» 19 gave the positive re- 
је toitem C; 12 + 
А. 


he frequencies of 
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TABLE УШ 


FREQUENCY OF RESPONSES OF EACH PATTERN TO THE ITEMS OF EACH SCALE 


Response Frequency in Each Scale 
Score ABO P-T P-I P-A T-I T-A A-I 
3 — 61 42 40 48 42 49 
2 - + + 88 103 62 87 104 78 
* + + 11 15 19 15 20 15 
ж + - 0 4 3 1 3 5 
$ - -+ 79 107 119 73 83 107 
*-+- 12 17 15 12 18 12 
*+-- 2 2 4 5 6 7 
0 === 72 35 63 84 49 52 


*Non-cumulative response patterns, classed as errors. 


rank are recorded in the condensed scalograms of 
Tables IX-XIV. For item A of each scale positive 
responses by individuals having scores of 2-0 are 
“errors’’; for item B, negative responses by in- 
dividuals having a score of2 and positive re- 
sponses by individuals having a score of 1, are 
“@rrors,” ete, 

A coefficient of reproducibility is the propor- 
tion of responses which fit the cumulative patterns, 
A coefficient of reproducibility of .92 means that 
from an individual’s scalescore we can reproduce 
his responses to each item with 92 percent accu- 
racy (by predicting the cumulative pattern of re- 
sponses). The coefficients of reproducibility for 
each scale are reported in the “total’’ column of 


$ separately. These 
coefficients are seento rangefrom .90-.95, which 


values are within the bounds established by Gutt- 
man for adequate reproducibility. 17 
Thus, within the lim 


е items. It is possible, 
with 90-95 percent accuracy, to say, for example, 


that a more ‘‘prudent”’ individual has made pru- 
dent responses to all items responded to prudent- 
ly by a less prudent individual, and more, 

The coefficient of reproducibility descr ibes 
the confidence with which we may assume that a 
given scale score represents a particular cumula- 
tive pattern of responses. But it does по describe 


the confidence with which we may reject the ро5- 
sibility that the cumulative pattern of responses 
might have been Obtained by chance were the items 
actually independent rather than cumulatively in- 
terdependent. 

For example, with three independent items, if 
3/4 answer the most popular item, C, positively, 
1/2 answer the next most popular item, B, posi- 
tively, and 1/4 answer the least popular item, А, 
positively; then 1/2 of 3/4 = 3/8 would answer both 
B and C positively by chance, if the items had no 
relationship, and 1/4 of 3/8 = 3/32 would answer 
all three positively by chance, etc. Thus, the 
expectation for each response pattern for inde- 
pendent items can be computed from the product 
of the proportions giving each response. 

If the items are strictly cumulative, however, 
then all those who give positive responses to A will 
also give positive responses {0 В апа С. Thus 1/4 
or 8/32, rather than 3/32, will give positive re- 
Sponses to all three items; none will give non-cu- 
mulative response patterns, etc. 

The expected proportions obtained from these 
hypothetical dis tributions under the two assump- 
tions of 1) independence of items, the null hypoth- 
esis, Ho, and 2) cumulative interdependence of 
items, Hc, are tabulated in Figure 3. Comparing 
the proportions under Hg and He we can predict 
the direction, or sign, of differences to be antici- 
pated between the expected frequencies under Ho 
and actually observed frequencies, if the direction 
of departure from independence is toward cumula- 
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TABLE IX 


PRUDENT-THEORETIC SCALE 


FREQUENCY OF POSITIVE AND NEGATIVE RESPONSES TO EACH ITEM OF THE PRUDENT- 
THEORETIC SCALE BY THE INDIVIDUALS OF EACH SCALE SCORE 


Items 
A B с 
Еге- Pos- Neg- Pos- Neg- Pos- Neg- 

Score quency itive ative itive ative itive ative Total 

3 61 61 0 61 0 61 0 183 

2 99 11 88 88 11 99 0 297 

1 93 2 91 12 81 79 14 279 

0 72 0 72 72 0 72 216 
Total 325 14 251 164 239 86 975 
Errors 13 0 11 0 14 50 
CofR .96 .93 .96 .95 


tive interdependence. These signs аге listed іп 
the last column of Figure 3 for the proportions 
used Неге as an example. 

In Tables XV-XX the chi-squared test has been 
applied to the goodness of fit of the obtained fre- 
quencies to the expected frequencies under the as- 
sumption of independence, Но, for each of the six 
scales. In each case the probability of obtaining 
the observed distribution, were the items actually 
independent, was less than .001. Therefore, we 
can reject the possibility that the items are inde- 
pendent with the confidence that we should be wrong 
in making such decisions fewer than one time in a 
thousand. 

Moreover, thedirection of the interdependence 
is substantially in the direction predicted under 
the assumption of cumulative interdependence. 
The contributions to the value of chi-squared of 
differences which are not in the predicted direc- 
tion have been segregated into the last column of 
each of the tables. They can be seen to be negli- 
gible—in no case affecting the level of significance 
of the value of chi-squared. 


Summary 


It was the purpose of this res earch project to 
construct an instrument whereby individuals might 


be validly ordered along six theoretical continua. 
It was determined that ‘‘order’’ should be assumed 
to exist only if individuals' decision behavior 
evinced a particular cumulative structure, so that 
a person at any given position on the continuum 
would make the ‘‘favorable’’ decisions made by all 
persons ‘‘lower’’ on the continuum, plus one or 
more additional ‘‘favorable’’ decisions. 

Because of the complexity of considerations in- 
volved in the process of making decisions in the 
variety of contexts offered as “items, ” there was 
considerable ‘‘higgledy-pigglediness’’ in the pat- 
terns of the responses; the many discriminations 
being based on differences іп a large number of 
factors, rather than cumulative differences in a 
uni-dimensional factor. 

By a process of repeated consolidation of re- 
sponses: first, by dichotomizing the original six 
responses into positive and negative responses, 
and then by combining items, and dichotomizing 
the responses to the elements into positive and 
negative responses to the compound items, it was 
found possible to elim inate most of the discrim- 
inations based upon idiosyncratic factors. 


The remaining discriminations coarsely ranked 
the individuals in a cumulative order, so that one 
might say that an individual with a given scale 
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FIGURE 3 


A PRIORI EXPECTED РВОРО 


RTIONS UNDER THE ALTERNATIVE HYPOTHESES 
OF INDEPENDENT ATT 


RIBUTES AND OF CUMULATIVE ATTRIBUTES 


Predicted 
Response Expected Proportion Expected Proportion Sign of 
Pattern 0 Не Difference 
= 113 _ 3 21.8 
ыды ЖЕЗ» наста "и ^ "qwe $ 
313 9 1 8 
- + + «.B.C = о. = L Ж = 26 == Б 
24 32 эн 4 32 
113 3 
„=й Е „ыш 0 ы 
+ + - АВ? = 1111 _ 
424 32 9 
813 9 
--+ a-B-C = а.ш = > = = 1. 8 = 
8 424 32 мал LI 
311 3 
-+- -В. “ЭХЭ Ed „Ж. 
авай: 424732 0 E 
111 1 
Жо. А-В- В таи EN 
ВЕ 424 = 32 9 - 
ЗЭР T = Sid 8 - 1. 8 
ЧАКЕ «лака а, + 


Note: A indicates the 
sponses to B; С 
В, the negative 


Proportion of positive responses to item A 
› the positive responses to C; а. 
responses to B; 2, the negative 


; B, the positive re- 
; the negative responses to A; 
responses to С. 
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TABLE X 
PRUDENT-IMMEDIATE SCALE 


FREQUENCY OF POSITIVE AND NEGATIVE RESPONSES TO EACH ITEM OF THE PRUDENT- 
IMMEDIATE SCALE BY THE INDIVIDUALS OF EACH SCALE SCORE 


Items 
A B с 
Еге- Pos- Neg- Pos- Neg- Pos- Neg- 

Score quency itive ative itive ative itive ative Total 

3 42 42 0 42 0 42 0 126 

2 122 19 103 107 15 118 4 366 

1 126 2 124 17 109 107 19 378 

0 35 0 35 0 35 0 35 105 
Total 325 63 262 166 159 267 58 975 
Errors 21 0 17 15 0 23 76 
CofR .94 .90 .93 . 92 

TABLE ХЛ 


PRUDENT-AESTHETIC SCALE 


FREQUENCY OF POSITIVE AND NEGATIVE RESPONSES TO EACH ITEM OF THE PRUDENT- 
AESTHETIC SCALE BY THE INDIVIDUALS OF EACH SCALE SCORE 


Items 
A B с 
Еге- Pos- Neg- Pos- Neg- Pos- Neg- 
Score quency itive ative itive ative itive ative Total 
3 40 40 0 40 0 40 0 120 
2 84 22 62 65 19 81 3 252 
1 138 4 134 15 123 119 19 414 
0 63 0 63 0 63 0 63 189 
Total 325 66 259 120 205 240 85 975 
Errors 26 0 15 19 0 22 82 


CofR .92 .90 293 292 
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TABLE XII 
THEORETIC-IMMEDIATE SCALE 


TIC- 
SES TO EACH ITEM OF THE THEORE 

OF POSITIVE AND NEGATIVE RESPON 

as 5 SCALE BY THE INDIVIDUALS OF EACH SCALE SCORE 


к س‎ — M —À—ÀMÀ ци 


Items 
А В С 
—— "i ТРЗИН. ВН PM 
= е - Neg- 
Fre- Pos- Neg- Pos Neg Pos 
Score quency itive ative itive ative itive ative Total 
3 48 48 0 48 0 48 0 144 
2 103 16 87 88 15 102 1 309 
1 90 5 85 12 78 73 17 270 
0 84 0 84 0 84 0 84 252 
Errors 21 0 12 15 0 18 66 
TABLE XIII 


THEORETIC- ДЕ STHETIC SCALE 


FREQUENCY OF POSITIVE AND NEGATIVE RESPONSES TO EACH ITEM OF THE THEORETIC- 
AESTHETIC SCALE BY THE INDIVIDUALS OF ЕАС 


H SCALE SCORE 
DONE eee 


A B С 
өв ИЩИ ымы Жэ — NM 
Fre- Pos- Neg- Pos- Neg- Pos- Neg- 
Score quency itive ative itive ative itive ative Total 
3 42 42 0 42 0 42 0 126 
2 121 23 104 107 20 124 


21 


- WILSON 


EWARDS 


26” 56” сб" 06" ¥ joo 
8), vc 0 ст 61 0 LZ 610114 
9,6 9), 675 181 PPI 5244 91, 965 TeIOL 
9Ч1 GS 0 eG 0 09 0 GS 0 
855 61 LOT PIT СТ 218! L 981 I 
766 G £6 СТ 68 8L 02 86 G 
171 0 6? 0 6? 0 6h 6? 6 

TEOL олце SAT} эле әлти эл Це IAI Áouonb 91006 
-Зем -SOd -дом - год -daN -504 -914 
2 я ү 
5193] 


_ —— ——— ————=—-———-——— ————— 


ЧЧОО5 ATVOS HƏVA AO S'IVOGIAIGNI AHL АЧ ATVOS ALVIAAWNI 
-OLLWHLSSV AHL AO WALI HOVE OL SSSNOdS3H ялиуоям ANV AALLISOd AO AONANOAYA 


ЯТУО5 @LVICGAWWI-OLLGHLSAV 


AIX яЯПчу 


JOURNAL OF EXPERIMENTAL EDUCATION 


22 


"5149318 әѕиойѕәл aAneqmumo-uoNx 


uey} 5591 43114290214 


100” 
v ر‎ шорээлд Jo бәә1дәс 
0 19989 7821 70000 `0 96666 ‘PGE TAS Te}OL 
66681 РИ ССУВУ "86% + 446 19 "ЕЕ GL "= 
96982 "9 71188 А” " 71188 `6 G жж. жа 
22616 51 89206 708: ы 89206706 eI на гээн 
ОРТ С 89271 'Р1- е 89271 "66 6L oe за 
65004 6 66800176 - е 6002 `6 0 айлын 
64998 6 62097 '9T- ч 68099712 IT dé ® ж 
2660170 5886778 - 4 8886Р 16 88 кре 
80186 СУ P6IPO'P£* * 90866 792 19 + + + 
S92U9.I9JJIQ 9) дочелета оопџолајпа | Аоџопрола Хоцэгрэлд илепеа 
рафотрелчп „(РР Ур реитедо Jo изт разое9хя рэллэ540 asuodsay 
рејотрела (0H)9J "i 


Jo поцпдъцчод 
OOOO МЫ 


яомяамяаяамі AO NOLLAWNASSV AHL няамп вятомяпояня аятонахя 
ANY зятомяпояня GANIVLEO МЯЯЛЫ,ЯЯ 5Я1ОМУ4ЯНО81а JO AONVOIINOIS AHL ЯО LSAL аянуповз-тно 


ятуоз ощячочно лмчапна 


АХ ЧЛЯУТ 


23 


EDWARDS - WILSON 


“сдиләуура әѕиойѕәл еадцешшпо-чом» 


100“ uey} 5591 А11484044 
р шорээлд то зеелзга 
5610070 68500 "96 60000 “0 66666 ‘eS 506 110/1, 
НННП ЭРЭР Элэг” Энэ 
101699 80821 217 т 06918 “сс СЕ алийн 
99166 ‘с с%00675- е 9%00676 6 di d 
1686 1 66188 '9 - 4 66188 752 LT массын. 
6620070 5610070 1%9697Т + 4 69205 “SOT LOT E 
64866 °0 ТОСРА T = 4 19071 '9 ? ий. 
669006 Р LOTZE OL” 4 LOTZE ‘G6 GI ono 
$0857 '0 PS6E6°9 - 5 73656 "601 СОТ же = 
6У691:6 919984 91" т P8SEP 92 о? DON 
сдопчелета 91 ооџолајпд  9орочелојла — Аоџопбола Коџопбола птепеа 
разотрелдип 2(91-91) реашезао Jo 1316 рэзээахч реллезао asuodsay 
Jo чоцпат анод рэзэтрэлд (0н)9] Oy 


FONAGNAYdACGNI AO NOLLGWNSSV AHL ччакп зятомяпояня ая очахя 
ANY вятомяпояня CANIVLEO МЯЯАЛ,ЯЧ SSIONVdOHOSIG JO чому 91414915 AHL AO LSAL аЯНУПОЅ-ІНО 


ATVOS &ALVIGHWWI-LNAGNUd 


IAX я19у41 


JOURNAL OF EXPERIMENTAL EDUCATION 


24 


‘suza}jed әѕЅиойѕәл олцешпшпо-чом» 


———— ЕООД ee aa > ت‎ endl 
100” ИРЦ) 5591 Аппаваола 

ee ИН DCP E DE a EAGER AS ae aes 
Ӯ шорээлд Jo $ээ13эа 

— — ET 
го, 


0 SEPET "ес 70000 `0 96666 7%26 сев 
келе СФ ээ гм л т ЕЕЕ 
69819 '6 19515 085 + $8451 "С? £9 - - - 
PGAGE р 20888 '9 - - 70888 ‘OI $ = + ж 
1200 $ 61110 OT- : 21110752 GT = Б 
75550 `O 68159 1 - s 68159 "ост 611 шал 
88881 1 69818 8 - в 6PELE `9 с = + = 
$538? Ф CLOPL TTI- ' 2,251,706 61 Де mode چ‎ 
8005071 9967978 - : #9619 “OL 29 deem 
1,806 `9@ ,2%00 722% ф £4966 4I OP 8-4-4 
pejorpexdun JUI AD рештејао Jo 81$ pajoadx релтезао asuodsay 
роејотрела (Оң)әу Or 


Jo шоцпаг шоо) 


зокчамчачамт JO NOLLdWASSV AHL чяамп SHIONSAO5)4 аятояахя 
ЯМУ зятомяпочня AANIV LAO МЯЯМІ ЯЯ SSIONVd3HOSIG AO ЧОМУО1А1МЭ16 AHL ЯО 15841. аЯНУП05-1НО 


J'IVOS OLLAHLSAYV -1мяапна 


WAX N'IS V.L 


25 


EDWARDS - WILSON 


"6Цл91164 әсиойбвәл әлт}®Епшпо2-О}{* 


100“ uey} 5591 Аппдедола 
? шорээлд Jo зеетзга 
61719 `0 19055 "ВОТ 60000 °0 96666 ‘PGE 606 TEOL 
РРТТО “LE 60875 “OFT + 16991 "ер 78 қт 
68616 6 486619 - Т 88661 11 6 = = л 
7$ св “OT 04489 'v6- В 01188 796 СТ Mi da. 
7969$ '9 69799 700- - 69799 "56 £L ғ“. 
06296 `L 0519878 - " 0819876 Т "теза 
620167? 09781 "017 - 09784 "СС ST ка 
6ІРІ970 6І?І970 52600 °L + 4 81066 '6L 18 ن‎ 
ој да СЕ 00077 "90% T 00099 ‘TG 8? жей 
сдопетета EL. ИИ | оополојла зопелета Коџепрбола Коџопрола илепеа 
разотретдчп 2(9:-97) рәшъејао Jo и215 рэзээахя рәлтә5до asuodsay 
Jo чоцпдъцчо3 рәлотрәса (05)91 01 


помзамзазач AO NOLLGINNSSV AHL чяамп зятокяпояня азі очахя 
ачу зятокяпбяня азмутао МЯЯАЛ,ЯЧ зятокуаянобта AO AONVOLMAINDIS AHL AO 1,584, CAUVNOS-IHO 


ЯТУО5 ALVIGAWII- OLLWHOSHIL 


ШАХ ята, 


JOURNAL OF EXPERIMENTAL EDUCATION 


26 


“6ШХө1үЕй әѕиойѕәл aAnemumo-uoNx 

но ИРЦИ cer И лы А 
100” ИРЦ) 5591 атпагаота 

и Ни 
p шоргэлд то $ээ:3эа 

eee 
965 TEOL 


62091 '0 1696Р 766 70000 '0 96666 ‘FEE 
ی ی وو ی ری یھ م یھ س ا زیی‎ С^ лык gta сы сы san 
93750 РТ 96621 00% + P0928 "Ве 6h "= 
02189 `0 Р91,072- : 7917208 9 ше жи 
999ЕТ 5 88029 '21- ё 88025706 81 > = 
да да! 60109 'II- к 80109 "56 £8 ج‎ 
75988 с 2916676 = ы. CPIES "9 с dI 
78045 1 10999 - : СРР 92 02 .... 
62091 "0 66091 '0 96800 °F + s 0966 '66 РОТ oe = 
59090 1 98890515 5 59186 7,2 А; + + + 
ی‎ н _ а(97-95) рәптездо Jo и915 рејозах gq pearasqo asuodsey 
ројотрола (OH)? ; Ој 


Jo uornqrjuo2 
کک ن د ڪڪ ج جج ججج ججج ج ج ج ڪڪ‎ 


токчакчачамт JO мошайшйзву AHL чяамп $ятомяпьння аяъочахя 
ANY 5ЗІОМЯПОЯЯЯ AANIV LAO мяял\1 ЯЯ зятомуаянозта AO dONVOLIINOIS JHL ЯО ISAL ачнупо$-тно 


ATVOS OLLAHLSAV - OLL AUOHHL 


XIX наду 


21 


EDWARDS - WILSON 


6Ц1911Ей4 әѕЅиойѕәл елпетшпо-чом» 


100” uey} 5591 AITIqeqoig 
? шорээлд 10 $ээ18эа 
6640070 09028 ‘TS 5000070 96666 ‘PEE TAs 301, 
Цэг» ППИ ИН و‎ nt НИ 
61218 'TI 99118 617 Ж ?680% СЕ 28 didi 
62878 `0 O8L68°¢ - и 08:68 "6 1 e wa 
18086 7), 7566 217 ы 76661 82 СТ и» шщ 
66600 `0 95600 `0 09795 0 + - 05675 901 LOT M EUR 
66670 1 67748 ‘с - 1 6Р?,87), с - ++, 
1199676 45:14 'LT- 4 УЕВОР СЕ СТ бәз, 
1660870 0892079 - - 08966 ре 8L eem 
52598702 99005 “Eat T 75661 82 бр + => 
зээцэхазиа 94 зополејла | гочелета Adouanbeag Коџапђола | млецеа 
розотрелдић (93-93) рәшъзао Jo u3tg рэзээ х 4 рәлләва) | esuodsew 
10 џчоцпатајио 5 резотрета (0ң)9: OT 


чомчакчачамт AO NOLLGINNSSV AHL ччама вятомяпдояня ачлочахч 
ЯМУ вятокяпбяня AANIV LAO МЯЯМІ ЯЯ SHSIONVdSHOSIG AO AONVOIAINDIS AHL ЯО ISAL CAYVYNOS-IHD 


ATVOS ALVIGHNWI-OLLAHLSAV 


ХХ GTAVL 


28 


JOURNAL OF EXPERIMENTAL EDUCATION 


Ї the ‘‘favorable’’ responses of 
75 co эш lower scale scores, plus 217 
more additional ЦАРЕ. а bp AA 

гасу. wa A 
г me к е м” confidence (p<.001), the 
possibility that these cumulative patterns of re- 
sponse could have arisen with such frequency by 
chance were the items independent. 


The instrument has validity, based upon its in- 


ternal cumulative structure, for the rank ordering 
of individuals along the six continua, which should 


be of value in relating these variables with any 
external variables. 


2. For descriptive €xpansion of thes 


3. E.g., Allen L. Edwards. 


4. Louis Guttman. “Тһе Cornel] Tech 


· A. Walker. ‘‘Answer-Patte 


. Ideally each “prediction?” 


. Cf. Shirley A. Star. “Тһе Scr 


FOOTNOTES 


1. T. Bentley Edwards and Alan B. Wilson. “At- 


titudes Toward the Study of School Sub - 


jects,” Educational Theory, VIII (October 
1958), pp. 275-84. 


e vectors 


and their ‘‘ideal types" see Edwards and 


Wilson, ibid. 


Techniques of At- 
titude Scale Constructi 


pleton-C entury-Crofts, ines 1 957), pp. 


13-14; Katherine Gordon C арт, “Тһе 
Questionnaire and O 


as Aids in Field E 
Social Surveys and Research, ed., by Paul- 


ine V. Young (Englewood Cliffs: Prentice- 
Hall, 1956), pp. 184-91. 


nique for 
Scale and Intensity Analysis, ” Educational 
and Psychological Me asur 


ements, VII 
(Summer 1947), рр. 247-79, 


rn and Score- 
Scatter in Tests and Examinations, “ Brits 


ish Journal of Psychology, XXII (July 1931), 
pp. 73-86. 


Should be based on 
core excluding the 
m being predicted. 
ence resulting from 
predicted” score in 
ot Substantially alter 
€ the number of items 


the individual’s total s 


the total score does n 
its significance wher 
is large. 


еепіпр of Psy. 
choneurotics іп the Army: Technical DS. 
velopment of Tests," Studies in Social 
Psychology in World War 1, Vol. TV, Mens 
urement and Prediction, ed., by байгаа, 


Stouffer (Princeton: Princeton University 


11. John Dewe 


Press, 1950), р. 494: “... in no case = 
the scale or quasi-scale scoring of an a 
prove to be the reverse of the empirica 2 
found differences оп that item. This out- 
come indicated, first, that our a sie 
judgment had been correct апа, secon › 
that the scale scoring, even on a зэргэ 
mous basis, preserved the решени effi 

сїепсу of the individual items.... 


8. Since the phi-coefficient is direc tly propor- 


tional to chi-squared when N is ایت‎ 
this basis of selection is analogous tO me 
recommended by Allen L. Edwards and 1 
P. Kilpatrick, “А Technique for the pet 
Struction of Attitude Scale S,"' Journa 3j 
Applied Psychology, XXXII (1948), 374-84, 
except that observed marginal totals rat e 
than Thurstone scale values were used е 
obtain a range of items. Actually this la : 
criterion was applied loosely, and, оез» 
result, many ‘‘middle’’ items were ~ 
ed, discriminating at the same level. one 
condition was corrected by the ee a 
of positive responses in the subsequent i 
provement of the scales, infra. 


е 
9. Samuel A. Stouffer. “Ал Overview of th 


99 
Contributions to Scaling and Scale TUBES 
Measurement and Prediction, Op. cit., р. * 
СГ. Jane Loevinger. “А Svo tem atic ся 
ргоасћ (0 Ше Construction and Evaluatio 
of Tests of Ability, ” Psychological ша, 
graphs, LXI (1947), Мо. 285; and ме 
Technic of Homogeneous Tests Compare 
with Some Aspects of ‘Scale Analysis’ an 
Factor Analysis, е Psychological ша, 
СУ (1848), pp. 507-20; б. Murhy, D ~ 
Мигрћу, and Т. м. Newcomb, Exper 
mental Social Psychology (New York: Нагр 
ег апа Bros., 1937 » p. 897. 


я Це 
10. Louis Guttman. “А Basis for Scaling Qua 


Е е- 
tative Data, >> American Sociological В 


с: е 
view, IX (April 1944), pp. 139-50; Th 


ди 
Basis for Scalogram Analysis, ’’ Меавиг- 
ment and Prediction 
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» Op. Cit., рр. 60-9 


у. Logic: The Theory of n 

(New York: Henry Holt and Сог, tical 

209: “A measured collection is iden 

with the kind of Collection just said to a 
© property of totality in contrast wit 


to which; (2) 
for counting; 


lation of these units until the limit ид ана. 
15 reached. The word accumulation 


- t 
here used 1пуоју ев something differen 


from the aggregation found in the merely 
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numerical set.... 

“Тһе cumulative aspect ingenuine col- 
lective propositions signifies that such 
propositions depend upon some principle 
of arrangement or order which is derived 
from the involved means-consequences re- 
ја ОН... 

Cf. also with Morris R. Cohen and 
Ernest Nagel, AnIntroduction to Logic and 
the Scientific Method (New York: Harcourt, 
Brace and Co., 1934), especially Chapter 
XV, Section 5, ““Тһе Formal Conditions 
for Measurement,’’ pp. 297-98. 


12. Qualitatively diverse elements may be oper- 


13. Cf. 


ationally equated and give rise to actuarial 
indexes which prove valid for a specific 
problem of external prediction. However, 
they are not theoretically pregnant since 
the grounds for their validity is ambiguous, 
1, ё., they are not internally valid in the 
sense of ‘‘measuring’’ what they purport 
to measure. Interpretationleads to falla- 
cious ‘‘inverse operationism.’’ Cf. Clyde 
H. Coombs, ‘‘Theory and Methods of So- 
cial Measurement, ’’ Research Methods in 
the Behavioral Sciences, ed. by Leon Fes- 
tinger and Daniel Katz (New York: Dryden 
Press, 1953), pp. 471-535. 


Louis Guttman. “Тһе Utility of Scalo- 
gram Analysis,’’ Measurement and Pre- 
diction, ор, cit., p. 159: “cxe This gra- 
dient pattern of errors indicates that, while 
there is not a single factor operating as in 
the case of a scale, nevertheless there is 
a Single dominant factor and indefinitely 
many small random factors, so that the 
prediction of any external variable must 
rest essentially on the dominant factor...” 
See also ibid. , рр. 160-63, 207, 208, 458, 
494, and 547. 


14. Samuel A. Stouffer, and others. “А Tech- 


nique for Improving Cumulative Scales, "' 
Sociological Studies in Scale Analysis, ed. 
by Matilda White Riley and others (New 
Brunswick: Rutgers University Press, 
1954), pp. 372-89. 


15. In each case where a higher proportion of the 


individuals in the lower of two adjacent 
ranks gave positive responses to an item, 
the frequency of positive and negative re- 
sponses in the two ranks was tabulated in- 
to a 2 х 2 contingency table. The null hy- 
pothesis Но, that the difference іп propor- 
tions of positive and negative responses 
between the two groups, in this reversed 
direction, could have occurred by chance 
were thereno relevant differences between 


the ranks, was tested by: 1) the chi-squared 
test where the total number of individuals 
was greater than 40; or between 20 and 40, 
provided that the expected frequency in 
each cell was greater than 5; or 2) the 
Fisher-Yates test of exact probability 
where the number was lessthan 20; or be- 
tween 20 and 40, where the expected fre- 
quency in some cell was less than 5. 

In all but two of the cases it was impos- 
sible to reject Но at the .05 level. Hence, 
the null hypothesis is tenable for those 
cases. Тһе two exceptions are shown in 
Figure 4, next page. 

Reversals of this order, which could 
have occurred approximately 2.5 times in 
a hundred by chance, occurredtwice in the 
860 pairs of adjacent ranks of the six scales. 
However, the presence of these reversals 
confirms the fact, which our interpretation 
of errors provides further grounds for af- 
firming, that the fineness of disc rimina- 
tion provided by the separate ranks are not 
valid, and may not be relevant to the com- 
mon factors in the continua. 


16. The ‘‘image’’ of each non-cumulative pattern 


is a cumulative pattern—in most instances 
of the next lower scale score. See Louis 
Guttman, ‘паре Theory of the Structure 
of Quantitative Variates,"' Psychometrika, 
XVIII (December 1953), pp. -19; and 
““Тһе Israel Alpha Technique for Scale 
Analysis," Sociological Studies in Scale 
Analysis, op.cit., pp. 410-15. 


17. The fact that assignment of frequencies to each 


of the eight response patterns can be made 
with only four degrees of freedom may not 
be immediately apparent. A three-dimen- 
sional contingency table, 2 x 2X 2, pro- 
vides the eight cells needed to house all 
possible combinations of positive and neg- 
ative responses to each of the three items. 
This is illustrated in Figure 5. 

Given the marginal totals of the fre- 
quency of positive and negative responses 
to each of the three items, then the joint- 
frequencies in only four cells may be as- 
signed freely (providedthe sum of the fre- 
quencies assigned to cells in the same 
plane do not exceed the marginal total of 
that plane), the frequencies in the remain- 
ing four cells being then determined by the 
marginal totals. 

For example, the front face of the cube 
holds the four cells, a, b, c, and d, which 
house the frequencies of the positive re- 
sponses to A. When three of these cells 
have been arbitrarily filled, using three 
degrees of freedom, the frequency in the 
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FIGURE 5 


2 x 2x 2 CONTINGENCY TABLE HOUSING THE FREQUENCIES OF THE 
JOINT OCCURRENCES AND NON-OCCURRENCES 
OF THREE ATTRIBUTES 
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fourth cell is determined by the marginal 
total of the positive responses to A. 

The right face of the cube holds the four 
cells, b, d, f, andh, which house the fre- 
quencies of the negative responses toC. 
Two of these cells, b and d, have already 
been filled; hence the arbitrary choice of 
the frequency of either, but not both, f or 
h, is possible, the other being determined 
by the total number of negative responses 
to С. This uses one more degree of free- 
dom. 

The three cells, a, b, and Í, of the up- 


per face of the cube have now been deter- 
mined, hence the frequency in cell e is de- 
termined by the total number of positive 
responses to B. Similarly, the шэн хийж 
in g is determined by the total number 5 
negative responses to B, cells c, d, and h 
having already been determined. = 

The solution above was simplified by 
first determining three cells in the same 
plane. However, it can be shown with 
simple algebra that there are only four 2 
grees of freedom regardless of the orde 


in which the individual cells are filled. 


APPENDIX 


е inventory are listed below in ord 


er of the popularity of the “positive’’ response, 
е compound items of each scale. 


Each item is preceded by: 1) the number of en 
the proportion of individuals giving “ровінуе” responses to the item, and 


esignated as positive. The following notation is used to designate responses: 


A - strongly agree 

B- moderately agree 

C - slightly agree 

D - undecided or no response 
E - slightly disagree 

F- moderately disagree 

G- strongly disagree 


Prudent- Theoretic Scale 
= theoretic Scale 


A 
.31 D-G ке Serie EH Ilike the ones with interesting scientific theories that neng 
» even if they’re not completely true ial prob” 
lems of space а у » better than those about the social p 
.38 о-с cocaine ue of the pyramids, I should be more interested in the еп” 
in the class i 
nificent display possible. HEE GU "олоту of Egypt which made such mag 
.41 A-D ши IBM machines are used to calculate insurance rates. However, data must 
е ted into the machines. The interviewing techniques for collecting data interest ше 
more than an explanation of how the machines work, " 
"42 A-E I would rather teach Science than do research, 
B 
2.50  D-G In our com lex indust i 
na p néustrial civilization а young person should Specialize early and stick 
461 C-G When I see an article about “electronic ins” 1; | ding out 
how they work than what their uses are, "SG? Tam inona манхан 
.61 A-E I should rather read 


па ins 
Michael Faraday's Experimenta] ne erstand Willi 


àm Shakespeare's Hamlet than 
Searches in Electricity. = 
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. 30 


. 43 
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I like history and civics much better than science and mathematics. 


с 


? 
1 am more interested in finding out how TV has affected people’s taste than in finding 
out how TV works. 


I am more interested in following newspaper reports on the recent discoveries regard- 
ing “negative matter’’ than on the developments in racial integration in the schools. 


I would rather study algebra than history even though algebraseems to be almost to- 
tally unrelated to any other subject. 


If I had an hour to wait for a train I should more likely read The Scientific American 
than The Atlantic Monthly. TUUS x 
Prudent-Immediate Scale 
A 


In a social studies course I would rather have the reasons why the U. S. didn't join 
the League of Nations explained to me than try to figure it out. 


I should prefer the live theater to movies if they were the same price. 


Sometimes when a fellow is out with the gang, he pretty well has to do a few things he 
knows he really shouldn't. 


Mercy killing should be legalized for cases of extreme suffering where there is no 
hope for cure. 


B 


When talking with my friends in the evening I'd rather talk about people we know and 
have fun with than talk about religion or philosophy. 


The opinion of friends helps more than reading in making up my mind. 


The foreign policy of our government should be based on high moral principles even 
though this may entail a loss of strategic power or prestige. 


I frequently think about the reasons for other people's mis behavior instead of react- 
ing with irritation. 


с 


I never worry about how things are going to work out—they usually seem to take саге 
of themselves. 


An impulsive person is warm and sincere; one who analyzes his emotions is cold ana 
“ ” 
рћопу. 


A business man should make his decisions strictly according to the interests of his 
business. He should not worry about what happens nationally to wages and prices. 


Science has definitely not been able to show that colored races are inferior to white 
races. 
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Prudent-Aesthetic Scale 
ч- га з аёас scale 


А 


Nobody should be allowed to cut down the redwood forests and turn them into lumber. 


If pay, housing, etc., wer 


е equal, I should like the work of а forest ranger better 
than that of a minister, 


I should rather take a “shop” course than a “world history” course. 


If I were a musician the thing I should like best about it would be getting across to the 
audience the basic idea of the composer. 


B 


I find paintings interesting when I am able to see how they represent the artists' atti- 
tudes toward life, 


Art should be appreciated intuitively. Analysis destroys its beauty. 


I like to watch a big house afire, 


To my way of thinking, the n 


eed to keep a cit 
tant argument in favor of sm 


у beautiful to look at is the most impor- 
Og control. 


с 


Because they need to g 


et close to life, 
they treat lightly the ti 


artists are entitled to Special consideration if 
ез of marriage, 


Theoretic-Immediate Scale 
mediate Scale 


А 


When I’m Studying math Ог Science it is refreshi i 
ng t А tching 
TV or talking with а friend. Милан шилж 


School mathematics courses should Conc?ntrate more on Practical consumer and bus- 
iness training. 


I would sooner have a big living room for Parties than have а workroom for hobbies. 
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If I were employed by a company manufacturing chemicals, I would rather stay in re- 
search than become a company executive, so long as the loss in pay was not too great. 


I prefer a science class that is run along fairly formal lines so that I can avoid the 
distractions arising from personal entanglements with other members of the class. 


с 


I enjoy working hard at a science project even if others don’t recognize my accom- 
plishment. 


I should rather be elected to the Student Council than be selected as an honor student 
in science. 


I don't like being interrupted while I'm doing laboratory experiments by friends who 
feel like talking. 


I would rather be known as the writer of a social column publishedin many papers than 
as the Director of an astronomical observatory. 
Theoretic-Aesthetic Scale 
A 
If I were interested in studying flowers, I should be attracted chiefly by the beauty of 
the flowers. Comparing the structure of different kinds of flowers would not interest 
me much. 


I prefer chess to checkers. 


When you go on an automobile trip it is much more fun to pick places to stay as you 
go along rather than writing ahead for reservations. 


I never wonder how the time is going when I’m painting a room or sawing firewood 
like I do when I'm studying math or physics. 


B 


Chemistry experiments are fun to watch so long as there are plenty of explosions and 
color changes. 


A person should throw himself into life with vitality —the scientists’ reflection on how 
things work is a wet blanket on the spontaneous pleasures of affection. 


When I was little I liked erector sets more than tops. 
It's a sloppy sailor who'll let his sails flap while he basks in the sun and breathes the 


crisp salty wetness of the air; the keen sailor watches the wind, studies thetides, and 
understands details of the rigging. 


с 
Scientists destroy much of Ше beauty of nature when they explain away its mysteries. 
The forces of nature are subjects for wonder and awe—not analysis. 
**Hotrod" racing would be fun if you didn’t have to know about and work on motors. 


When I look at the stars at night 1 sometimes meditate on the way the universe works. 
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Aesthetic-Immediate Scale 


A 


I would rather go sailing by myself than watch a football game. 


If I had the necessary athletic 


prowess I should prefer to excel in Ше cross-country 
marathon than in football. 


I like to ride alone. The feel of a good horse under me, his strength and his rhythm, 
more than make up for the lack of fellow riders. 


Abilities of sign writers are differen 
ness. If I had the ability to do eithe 
paint the signs. 


t from the abilities of men who run азып Б" 
г, I would rather learn to run the business Ша! 


B 


I spend more of my free time on hobbies like stamp collecting, woodwork, etc., than 
going to parties or entertaining friends. 

Sailing on a boat would be fun with a group of people, but I don’t think I’d care much 
for it by myself. 


Ilike Dixieland jazz better than “rock and гоц,” 
A girl should wear Sweaters and “ 


pearls” or whatever most of the girls are wearing 
rather than conspicuous hand-mad 


е jewelry. 
c 
I should prefer to be a machinist rather than a salesman. 


I enjoy Swimming in the ocean by m 


yself, or, for Safety, 
guard, more tha 


: Р ие 
with а companion ог lif 
п Swimming in a pool. 


I think I should enjoy Longfellow's poem ‘‘Evangeline’”’ more if it were told as a love 
Story in modern prose. 


When I am buying clothes I рау less attention to the ones that don't show than to the 
ones that do show, 
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STUDIES IN SOCIAL PERCEPTION: 
WORD PRODUCTIVITY 


FRANK J. ESTVAN* 
University of Wisconsin 


THIS IS A study of the length of children’s oral 
responses to a series of pictures. It is one phase 
of research dealing with children’s social percep- 
tion in which picture projection was employed. 
Word Productivity is of concern to workers in the 
field of projective testing who are faced with the 
problem of interpreting various measures of pro- 
ductivity. It has also been studied by specialists 
in human development and language arts whose 
findings seem to indicate that urban children are 
more verbal than rural, that girls excel boys in 
language development, and that children with high 
I.Q.'s are more verbally adept than those with 
less intelligence. 

The present study is based on the assumption 
that behavior is caused, and that word count should, 
therefore, reveal something about the individual. 
Acceptance of this assumption would make tenable 
several explanations for the number of words pro- 
duced in response to the series of pictures used 
in this research. One would be that Word Pro- 
ductivity is an index of interest or effort, some - 
times referred to as subject cooperation. Another 
is that Word Productivity is a reflection of the 
meaningfulness of a situation to the individual. 
Third, Word Productivity may be an indicator of 
the pictorial quality or provocativeness of a pic - 
ture irrespective of its content. 

To test these hypotheses, comparisons have 
been made of the Word Productivity of rural and 
urban children, boys and girls, first and sixth 
grade pupils, as well as high 1.0. and low 1.0. 
groups. If differences among these groups can be 
explained in terms of what is already known about 
children or by a new setof constructs, the useful- 
ness of Word Productivity as an aid in the inves- 
tigation of social perception will have been estab- 
lished. 

It must be noted that the verbal behavior being 
examinedis not what an individual knows about 
words, but the number of words he uses to com- 
municate ideas. This distinction is the same as 
that between knowing how well a child can read, 
and what he does read. The meaningfulness of 
the responses elicited in this study will be found 


ЖАП footnotes will be found at end of article. 


in another source. 1 
Source of Data? 


The data analyzed in this report were obtained 
in Part I of the Social Perception Interview. А 
Life-Situation Picture Series consisting of 14 
black and white 2 x 2 inch slides was projected on 
а 30 х 40 inch screen, the directive ог each stim- 
ulus being “What story does this picture tell?” 
The subject’s remarks were tape recorded and 
typewritten verbatim to yield a highly accurate 
record of his free-association responses, 

Of the total of 88 subjects selected at random, 
half were drawn in equal proportions from the 
first and sixth grade children attending one-room 
schools inarural Wisconsin county. The other 44 
subjects were similarly selectedfrom the first and 
sixth grade pupils attending the public elementary 
Schools of a Wisconsin community of approximately 
55,000 population located in a highly urbanized 
region. Sex was equated within each grade level 
for both rural and urban samples, thus providing 
eight subgroups of 11 subjects each. Intelligence 
groups were formed by selecting the two persons 
having the highest Intelligence Quotient in each of 
the eight subgroups to constitute the high-ability 
sample, and using similar procedures at the op- 
posite extreme to form the low-ability group. 

The determination of Word Productivity in- 
volved more than the mechanical counting of a suc- 
cession of words. In many instances, considera- 
tion had to be given to the context in which an ex- 
pression occurred. To standardize procedures, 
the following guides were established: 


A. Counted as one word 

1. Any word contained in a standard English 
dictionary. 

2. Slang expressions or c olloquialisms —cop, 
yeah, sorta’, lotsa’. 

3. Contractions—isn’t, there’s, ain’t. 

4. Compound words—right-handed. 

5. Abbreviations—YMCA, О.К., ABC’s. 

6. Numbers below 100. 
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7. Partial words when the context indicated 
| their use ава complete word—‘‘semi 

(semi-trailer truck), “spose” (suppose), 
“уеуог” (surveyor). 

8. вере ons when they were part of the grow- 
ing perception of the subject—‘‘boy...I see 
а boy” (each word counted), ‘‘and...anda 
кїї.” 


B. Not counted 
1. Sounds—ah, um, mm. 
2. Incompleted words which occurred when the 
subject changed his mind after he had begun 
а word—''sa..."' (sad), “рео...? (people). 
3. Repetitions or stammer ing unrelated to 
thought processes—‘‘and... and... and.. 49 
(latter two notcounted), ‘‘a...a...a.. 


22... 
(latter three not counted) 


In order to eliminate observer inf] uence, no 
attempt was made to estimate the number of words 
missed in those few instanc ез when responses 
were not audible on the tape recorder, Neither 
was the observer required to determine when the 
child had stopped talking about one life- 


situation 
picture and was signalling for the next, such re- 
marks as “that’s all," *O.K.," “1 don’t know 


any more" being incl 


uded in the word count for 
the picture being viewe 


d on the screen. 


Does Word Productivi 


ty Indicate Pupil 
Involvement? 


This is the first and most basic question which 
needs to be answered. If the differences in Word 
Productivity among the pictures in the Series are 
so small as to reflect the operation of chance, a 
lack of personal involvement on the part of the 
Subjects could be inferred. This would tend to in- 
validate the data secured in the picture-story por- 
tion of the Social Perception Interview, and cast 
doubt on all the procedures which followed. 

It is conceivablethat the life-situation pictures 
might be equally stimulating in Content or form of 
presentation to the subjects in this Study. The 
fact that these situations vary in: a) social back- 
ground representation (rural-urban, high status- 
low status, child-adult), b) social functions de- 
picted (10 basic life activities), c) Scope or com- 
plexity of situations (general environment-specif- 
ic social function), and d) child-figure presenta- 
tion (clearly defined or ambiguous) would seem to 
lessen such a possibility. For all these variables 
to average out among the 14 pictures in approxi- 
mately equal word responses is not likely except 
that subjects react in a random manner rather 

ts in each Situation, 

than to the elemen { і 

able I presents the variance in Word Produc- 

Tab 88 subjects to each of the 14 life-sit- 
tivity of the 88 The test for homogeneity of 
uation pictures. | these differences ata very 
variance establishes 


high level of confidence. This means that the Mars 
iability in Word Productivity among the 14 pictur E: 
is too great to consider these responses as 3t 
longing to the same population. In short, the aj 
ferences in Word Productivity among the pictures 
in the series would appear to be real and not 
simply due to chance. The subjects in this study 
were not responding to the Life-Situation Picture 
Series in random fashion. 


Is Word Productivity Related to Certain 


Pupil Abilities? 


In considering the significance of Word Produc- 
tivity, the possibility exists that the total тйс 
of words produced is simply ап index of Lu ^ 
intelligence or a more specific verbal factor. ale 
may be that individuals have developed a gener a 
ized verbal behavior pattern, as far as quantity } 
concerned, which is more influential in determin” 
ing Word Productivity than are the pictures че 
selves. The broader base of general intelligent 
must also be examined in this connection, becaus® 
of the high relationship which usually prevails ра 
tween scores оп intelligence tests and tests 
verbal abilities. кё 

A high correlation between Word Р ro ductivi A 
and intelligence or verbal facility would discou 
the meaningfulness of the Subject’s reactions " 
the Ше situations. The quantitative aspect of hie 
responses could then be attributed to an дер. 
ing “паНуе” ability in general or to amore spec 
ic verbal factor. If no rel ationship were fou 
between these measures, one would have greater 
confidence in regarding differences in Word а 
ductivity as being assoc iated with differences 
the meaningfulness that these situations һауе 
elementary school children. ity 

Measures of intelligence and verbal ы! 
were obtained from the Stanford-Binet Intellige" 
Test, Form L. The Vocabulary score is the пи re 
ber of words for which correct definitions е 
given until six consecutive words were misse" 
These data were available for all 88 subjects i? 
cluded in the study, and m ау be regarded 25 
qualitative aspect of verbal behavior. The ps 
Fluency score 15 the number of words named ' 
‚ is a quantitative meas Ur 
Thelatter is Item 5 for Ye 
grade children and 
children, thus prov 
sis. 

As is indicated in 
ations between W 
Age, b) Intelli 


Table П, попе of the corre: 
ord Productivity and a) мен 
gence Quotient, c) Vocabulary 86019 
апа а) Word Fluency Score are statistically sm 
nificant for the first grade, the sixth grade, 0 

for both groups combined. In eve ry 115 tance 
however, the correlation of Vocabulary Score WI 
Mental Age and Int elligence Quotient is signifi 
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TABLE I 


HOMOGENEITY OF VARIANCE IN WORD PRODUCTIVITY OF TOTAL GROUP 


FOR LIFE-SITUATION PICTURE SERIES 


Picture 

No. Name N Ex rx? s? 

1 Bedroom 88 2619 125839 950. 50 

2 Hovel 88 3747 227797 784. 50 
3 Dam 88 3877 253489 950. 35 
4 Factory 88 3391 206103 867.05 

5 Mansion 88 3989 253121 831.05 

6 Church 88 2906 144502 557.90 
7 Farm 88 3448 203506 786. 29 

8 Swimming Hole 88 3399 211741 924,77 

9 Capitol 88 3296 192746 796. 50 
10 City 88 3438 205728 820. 82 
11 Resort 88 3474 211676 856. 68 
12 Schoolroom 88 3380 217738 1010. 52 
1з ушаде 88 3494 237508 1135. 41 
14 Dock 88 3929 287607 1289. 50 

" — (ZX) 
Fmax= genet 20128000 - 2.34** s? = Х-, 


For К = 14 and п = 87: P(Fmax > 1.80) = .01** 
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TABLE II 


CORRELATION OF WORD PRODUCTIVITY AND MEA 


SURES OF INTELLIGENCE 
AND VERBAL ABILITY 


First Grade 


Sixth Grade 
1 2 3 


1 2 3 
1 
2 .106 .034 
3 . 140 .985** 032 .993жж 
4 . 178 . 741** .739ж%ж -.119 683жж . T00** 
5 . 038 . 254 .325ж .210 100 - 090 
Total Group 
Key: 1 2 3 


1- Word Productivity 

2— Mental Age 

3- Intelligence Quotient . 144 
4— Vocabulary Score 

5— Word Fluency Score: 


«097 .482** 
1N = 24 first grade and 32 Sixth grade .130 .931жж .411%ж 
instead of 44 for each grade as for 
other measures, · 120 . 569** 


-. 217* 


жр<.05 
жжр <.01 
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cant beyond the one percent level of confidence. 
For the total group of 56 subjects who responded 
to the item, the correlation of Word Fluency Score 
with Mental Age and Intelligence Quotient is like- 
wise significantly different from zero. This is 
also true with respect to Word Fluency Score and 
Intelligence Quotient in the First Grade group. 
The fact that Word Productivity is not related 
to level or rate of intelligence as well as to qual- 
itative or quantitative measures of verbal ability, 
would support the conclusion that word responses 
to the Life-Situation Picture Series reflect some- 
thing other than these measures. The Social- 
Perception Interview, from the standpoint of word 
productivity, can make no claim to being another 
form of intelligence test or test of vocabulary. 


Is Word Productivity Related to Character- 
istics of the Instrument? 


Differences in Word Productivity might also 
be explained in terms of certain characteristics 
of the instrument rather than being attributed to 
differences in the meaningfulness of the life situ- 
ations to subjects. It could be that while the sub- 
ject-matter of a picture is interesting to elemen- 
tary-school children, its pictorial treatment has 
little appeal. There 15 also the poss ibility that 
the placement of a picture in the series will effect 
the length of verbal responses elicited. Word 
Productivity of the beginning and latter pictures 
in the series may simply be indications of the need 
for а ‘‘warming-up”’ period or fatigue due to the 
length of the series. 

Pictorial Representation—One way to examine 
the first question is to determine the relationship 
in Word Productivity among the various pictures 
in the series. A significant negative correlation 
between one picture and the others in the series 
would call for extended analysis to discover the 
factors which might explain such deviate respon- 
ses. These could be matters of form or content. 
The intercorrelations are positive and statistical- 
ly significant, thus lending a high degree of con- 
fidence in a relationship that is not zero. Of the 
91 correlations, 30 are +.80 or higher, the high- 
est being between Picture No. 10 (City) and Pic- 
ture No. 13 (Village) which are the most clear- 
cut paired-comparisons inthe series. Except for 
Picture No. 1(Bedroom), each picture correlates 
+.80 or higher with one or more pictures; Picture 
No. 12 (Schoolroom) corr elating with nine other 
pictures at this level of relationship. 

The generally high positive correlation in Word 
Productivity among the 14 life-situation pictures 
indicates that no one picture stands apart from the 
rest of the series because of extremely high or 
low stimulating properties. Although the pictures 
in this series vary in their power to elicit word 
responses from elementary school children, this 
difference is not so extreme in any one picture as 


to question its suitability in content or pictorial 
qualities. 

Picture Sequence— The fact that the first pic- 
ture in the series is the only one which did not 
correlate as high as +. 80 withothers in the series 
is one reason for investigating the relationship be- 
tween Word Productivity and PictureSequence. A 
significant negative correlation could mean that 
subjects gained in confidence and interest and, 
hence, increased in Word Productivity as they 
proceeded through the series—or that the pictures 
toward the end of the series were more provoca- 
tive. А high positive correlation, on the other 
hand, could mean that subjects became fatigued 
or lost interest as they proceeded, that the latter 
pictures in the series were less stimulating, or 
that subjects became more direct and efficient in 
expressing themselves as they gained practice. A 
lack of relationship between Word Productivity and 
Picture Sequence would indicate the absence of 
systematic influences such as “warming up, ” fa- 
tigue, or practice effects, and an evenness in the 
stimulating quality of the life-situation pictures. 

Table IV indicates that for the total group of 88 
subjects there is no relationship between Word 
Productivity and the order in which pictures are 
presented. This is also true for the comparative 
groups listed in Table V with the exception of the 
High Intelligence group. In this case, both rho 
and Kendall’s tau are significantly different from 
zero at the 4 percent level of confidence. If the 
pictures at the end of the series were more stim- 
ulating, we would expectanincrease in Word Pro- 
ductivity from all groups. In the absence of this 
tendency, a more plausible explanation is that 
bright children profit greatly from experience 
and, therefore, reach higher levels of perform- 
ance as they progressthroughtheseries. For el- 
ementary-school children, in general, it is clear 
that differences in the Word Productivity of vari- 
ous pictures are not related to their position in 
the series. 

It is interestingto note in Table V that the rank- 
ings for Picture No. 1 (Bedroom) and Picture No. 
6 (Church) differ by only one point among the eight 
subgroups, and that they rank 14th and 13th re- 
spectively. Whether scenes of a child in bed and 
people going to church seem so obvious to children 
as to need no prolonged comment or whether they 
hreless interesting or meaningful than the other 
pictures can only be surmised, For each of the 
remaining 12 pictures, however, the rankings 
made by the comparative groups differ from 4 to 
9 steps, the average rank-difference being 7. 


Is Word Productivity Related to Differences in 
Social Background and Intelligence? 


Meaningfulness is a function of the kind of ex- 
periences which an individual has, and his ability 
to learn from these experiences. Accordingly, it 
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TABLE IV 


CORRELATION OF WORD PRODUCTIVITY AND PICTURE SEQUENCE 
FOR TOTAL GROUP 


С ___—‏ ی ڪڪ 


Picture НА... NN 

Мо. Total Mean Rank 
ЕР کو ی و ی ی ی چ‎ с... ӨН 

nies 2619 29.76 14 
ПЕТЕР 3747 42,58 4 
Песак 3877 44.06 3 
Ме ETT 3391 38. 53 10 
ТУТ? 3989 45.33 1 
ОРИ 2906 33.02 13 
Peewee 3448 39.18 7 
TELET 3399 38. 62 9 
— M 3296 37. 45 12 
C — 3438 39. 07 8 
Аека анада 3474 39. 48 6 
БИ — € 3380 38.41 11 
[p c — 3494 39. 70 5 
i АИТ 3929 44.65 9 

Total 48387 39. 28 


rho = -. 147 
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TABLE V 


CORRELATION OF WORD PRODUCTIVITY AND PIC TURE SEQUENCE 
FOR COMPARATIVE GROUPS 


Intelligence 


See NN E А кыс БЕ Pe jaan م‎ 

Picture Rural Urban Boy Girl 1st 6th High* Low* 
пр ин 14 14 14 14 14 14 14 13 
ЖІГІ 2 4 4 4 1 8 8 2 
С сон 4 2 2 3 2 3 4 9 

Е а мо 10 10 5 12 8 9 12 3.5 


**For rho 
Ї 


— 2.321 **Kendall’s tau 


2 = — 2.080 

Р = 

**For computing rho, t, and Kendall’ 
(London: Charles Griffin and боон tau see: Maur 


» Ltd., 1 ісе @. Kendall, Rank 


948), Chaps. Тапа Iv. Correlation Methods 


p 
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woul З 
ing соса to expect that children ћау- 
sociat nt social backgrounds suchas those 45- 
ed with rural-urban livi 
(grade) would respond diff ing, sex, and age 
scenes of pon ЗА a ifferently to a variety of 
the reactions of bri ~ е. _For the same reason, 
ш шет шик differ from 
(ена less intelligent. 
„E 17117223 а ape of 
communit [A roduc tivity is related to type of 
made of би ia ae and to sex, analyses were 
1UB, 1UG e four first-grade subgroups (1RB, 1RG, 
Вини ) and another of the four sixth-grade 
Eoi е (6RB, 6ЕО, 6UB, 606). 3 Tests of ho- 
ized in = variance foreach picture, summar- 
in the firm le VI, indicate that with one exception 
groups TI grade, and two in the sixth, the sub- 
ity of th id nor ditte» significantly in the variabil- 
girls шэг Тһе first-gr ade urban 
ance fo ntributed disp roportionately to the vari- 
grade т Picture No. 14 (Dock) and the sixth- 
abilit ural boys acc ounted for the great vari- 
and sd эл responses to Picture No. 4 (Factory) 
three 1 icture No. 8 (Swimming Hole). In all 
han и sex produced greater variance 
lysis а lived in. The results of the алаг 
for the variance for the first-grade subgroups and 
Table 3 sixth-grade subgroups are presented in 
уш УП, and the main effects variance in Table 
Fs Жы попе of the 56 analyses (14 pictures Х 
Sistine x 2 grade groups) was the F ratio sta- 
that th ly significant. This is convincing evidence 
Sex а e locality in which the individual lives and 
Eiken not related to the mean number of words 
Gr A response to the life-situation pictures. 
ro e Differences—A comparison of the Word 
Pe пити of the 44 first-grade subjects and the 
Miche subjects is P 
C Picture No. 1 (Bedr 
duced Mevet, the mean number of words pro- 
that fo y the sixth-grade pupi 
insi: the first. In two cases: а 
епсе M and Picture No. 14 (Dock), this differ- 
Tather ав great enough to be considere 
&rou than due to sampling fluctuations: . word 
a the Mansion sc en e ranked high in = 
and re the sixth grade placing it secon 
Dock 5 first grade ranking it third. e 
it placed ne ranked highest in the si 
It wo тісі seventh in the first grade. 
reali appear that both groups 
ice the significance of living in wealthy Cir- 
Новина Лат difference 
ОЕ sixth grade onthe Mansion picture 1 
ч сау due to the fact that sixth grade children 
erstand a longer period of acculturation. 
Бізде gr why the greatest difere "once to the 
tock sean 8 should appear with 4 
Ure wae ne» it is important to no — 
Wor] aL amed to depict human relations aren 
escale. That sixth gram Cation- 


Wer 
em _ 
ore stimulated to talk about wor 


ships than were first graders, fits in wit 
monly held moron that children md pui 
, "i А 
жегеді to the “Таг” in their development of inter- 
Intelligence Differences—In Tabl 
Productivity of 16 high 1.0. 5 ubje ea 
1.0. pupils аге compared for each of the life-sit- 
uation pictures. In every case, the mean Word 
Productivity of the former group exceeds that of 
the latter. For six of the 14 pictures, the differ- 
ence is statistically sign ificant and not due to 
chance. Thisis considerable evidence to the effect 
that bright children, irrespective of where they 
live, their sex, or gr ade are more stimulated to 
talk about certain life-situation pictures than are 
a comparable group of children who differ in being 
below average in intelligence. 

Why bright children should be more productive 
on the City (No. 10) and Farm (No. 7) pictures in 
the community pattern and to the Hovel (No. 2) and 
Resort (No. 11) in the social status block of pic- 
tures is not immediately apparent. The picture of 
а Dam (No. 3) and that of the Capitol (No. 9), how- 
ever, adult pictures designed 


are two of the three аси 
to contrast age differentials in social experience, 
It is, therefore, 


not difficult to unders tand why 
children of high intelligence would be more stimu- 
lated by scenes depicting the conservation of natu- 
ral resources as in the Dam and of citizenship val- 
ues as reflected in the Capitol building than would 
be the case with less intelligent children. 


Are Extreme Deviations in Word Productivi 
for Bach Individual Related to Differences in 
Intelligence? 


Social Background ant 

In the foregoing ап aly ses, Word Productivity 
has been examined from the point of view that sig- 
nificance is in direct proportion to magnitude. Sig- 
nificant pehavi however, may be expressed by 


avoidance as well as attraction, and either may be 
the same underlying causes. With 
ord productivity, this would be in- 


by an extremely meager word count or by 

oductive response. А determ- 
behavior in these terms would 
nto account the wide range of individ- 
singeneral level of Word Produc- 
d count of 63, for example, might 
high response forone individual, and 
r below average productivity for another 
e requirement that each indi- 
constitute the basis for 


ual difference 
А wor 


i of analysis. 
this type ermine what is a deviant response for an 

поп must be made that word 
rmally distrib- 
le to determine mean Word 
variabili responses for 


ivity and 
produet Ч и the responses beyond +1 SD from 
the oe tals mean Word productivity are des- 


al 
tothe 14 pic 
is then possib 
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TABLE VI 


HOMOGENEITY OF VARIAN 


CE IN WORD PRODUC TIVITY OF FIRST 
AND SIXTH GRADE 


SUBGROUPS FOR EACH PIC TURE 


ل 


1st Grade 6th Grade 
Picture Fmax Fmax 
a 2. 69 2.70 
Bn access 2.17 2.23 
— ка 1.66 1.59 
— 4. 45 10..11** 
раси à. 97 2.93 
—— 1.9 3.06 
-—m 9. 04 1.91 
сина 2.74 9.05** 
_ eddie saos 2. 20 2.65 
IU халааг: 3.19 2,49 
e ЧИНИ 2. 63 2.84 
v — a. 4. 95 2. 64 
il.. ихийг 3. 69 2.60 
> ыс» 7. 32% 


2.18 
For К = 4 and N = 10: 


“Р(Ешах > 5. 67) 
**Р(Е ) 


D—— 


=== 
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ignated as exceptional for him, approximately 32 
percent or an average of 4.4 of his picture re- 


sponses would be included inthe negative and ро5-. 


itive tails of thedistribution. If cut-off points are 
established at +2 SD from the mean, somewhat 
less than 5 percent or . 64 pictures would be sin- 
gled out. (In actual practice, the latter resulted 
in the identification of such unusual responses in 
one of two subject’s responses.) 

Total Group—The number of instances when 
the Word Productivity for a picture was beyond 
+1 SD and +2 SD from each individual’s mean is 
Shown for the total group of 88 subjects in Table 
XI. Although there is no appreciable difference 
between the total number of very limited re- 
sponses and those showing great productivity at 
the +1 SD level, the difference is highly signif- 
icant for the +2 SD scores. This tendency for ex- 
tremely marked variability to be in the direction 
of over- rather than under-production is, there- 
fore, one which may be held with a high degree of 
confidence. Its logic stems from the fact that 
whereas the zero limitation applies to the negative 
end of the distribution, there is no ceiling for re- 
sponses falling in the positive side of the scale. 

At the +1 SD level of variability, five pictures 
show a Statistically significant difference between 
the number of times they evoked little or much 
productivity. In Picture No. 2 (Hovel), Picture 
No. 5 (Mansion), and Picture No. 14 (Dock), the 
direction is positive whereas in Picture No. 1 
(Bedroom) and Picture No. 6 (Church) the direc- 
tion is negative. This is in keeping with the find- 
ings presented іп TableIV showing these pictures 
to rank 4, 1, 2, and 14, 13 respectively in Word 
Productivity. 

Differences between the +2 SD scores for each 
picture are not statistically significant or are too 
small to be tested. It isinteresting to note, how- 
ever, that except for Picture No. 3 (Dam), the 
Bedroom and Church scenes are the only ones for 
which such an extremely limited response was 
forthcoming. The Bedroom situation, however, 
is also among the highest producers at the +2 SD 
level which is not the case for the Church scene. 
It should also be noted that Picture Мо. 12 (School- 
room) is the only one which did not elicit at least 
one voluminous response from the 88 children 
concerned. 

The above findings support the generalization 
that elementary-school children are more stimu- 
lated to talk about conditions of poverty (No. 2) 
and wealth (No. 5) as well as far away places in- 
timated in the Dock scene (No. 14) and to be less 
stimulated by such immediate experiences as 
sleeping (No. 1) and going to church (No. 6). It 
must be remembered, however, that these can be 
held only as te ntative conclusions for it is quite 
possible that another set of illustrations of the 
same life activities might result in different re- 


sponses. 


Comparative Groups— Analyses of the preced- 
ing data in terms of community, sex, grade, and 
intelligence comparisons are given in Table XII 
(+1 SD) and Table XIII (+2 SD). In noinstance is 
the distribution of positive and negative deviations 
for two groups significantly different. Each ex- 
hibits the same general trends as for the total 
group reported in Table XI, or varies so little as 
to be considered a random fluctuation from these 
trends. 

When these same data are compared in terms 
of the social-background patterns on which the 
Life-Situation Picture Series was designed, cer- 
tain statistically significant differences appear in 
all but the rural-urban comparison (Table XIV). 
Girls were more productive on high social-status 
pictures, but boys responded better to low social- 
status situations (Table XV). Whereas first-grade 
children were less productive on adult scenes. 
sixth-grade pupils wereless stimulated by pictures 
centered on child-experiences (Table XVI). The 
high intelligence group was more responsive to the 
adult pictures, and the low intelligence group re- 
sponded more freely to the child-experience pic- 
tures (Table XVII). 

That rural children would differ from urban 
children in their reactions to scenes portraying 
rural-urbandifferentials seems to be a reasonable 
expectation. Why such differences in Word Pro- 
ductivity failed to materialize cannot be accounted 
for at this time. It may bethatthis differential 
does not exist in Southeastern Wisconsin. The 
rural-urban groups being equated in sex and grade, 
no differences were expected in the reactions of 
these groups to the social status and age patterns, 
and none appeared. 

Why boys should talkat such great length to low 
social-status pictures can be explained, in part, 
by the weighting produced by the Swimming Hole 
scene (No. 8) which might be more attractive to 
boys than to girls. The consistently greater re- 
sponse of the girls to the high social-status pic- 
tures is in keeping with developmental data bear- 
ing on the advanced social maturity of girls over 
that of boys. With this maturity could go an in- 
creased sensitivity to the status symbols depicted 
in the Mansion (No. 5) and Resort (No. 11) scenes. 
There being equal representation of rural-urban 
and first- and sixth-grade children in the sex 
groups, there would be no reason for anticipating 
differences in their responsiveness to the com- 
munity block of situations and the age pattern. No 
sex differences were discovered in these two pat- 
terns. 

That differences in grade and, hence, age 
should be reflected in different reactions to the 
child-adult pattern of life-situations can also be 
explained logically. As was indicated in the pre- 
ceding discussion on grade differences in total 
Word Productivity, thisis in keeping with present 
knowledge of the increasing span or breadth of 


52 JOURNAL OF EXPERIMENTAL EDUCATION 


TABLE XI 


PICTURES DEVIATING +1 sp AND +2 SD FROM INDIVIDUAL MEAN 
WORD PRODUCTIVITY FOR TOTAL GROUP 


SD 
Picture E. +1 x2 -2 +2 x? 
С dep acea v 45 12 19. 105*** 5 6 
Bees Wasi Е 10 24 5. 765% 0 2 
~ — HÀ 13 21 1 7 
Be и боље 16 8 0 2 
a 6 29 13. 828жжж 0 3 
ЖҮРЕТІН 22 6 8. 036% 9 1 
Фея 13 13 0 1 
РИ 11 12 0 4 
IN 17 11 0 А 
#0 нека o es 12 11 0 2 
зе и M 10 7 0 1 
Аке киа 16 11 0 0 
И 13 11 0 9 
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26 12. 500жж»ж 0 6 
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view that develops withage. In short, first-grade 
children haven’t yet broadened their interests to 
include more remote situations, whereas sixth- 
grade children no longer find great stimulation in 
а consideration of more immediate experiences. 
The equating of community and sex in the grade 
groups would discount the possibility of differences 
appearing in the community and social-status pat- 
terns, which prediction was borne out by the find- 
ings. 
The responsiveness of bright children to adult 
or remote experiences and that of less intelligent 
pupils to child or immediate experiences can like- 
wise be interpreted from the standpoint of the re- 
lationship between mental development and the 
growth of interests and concerns. These groups 
being equated in community, sex, and grade var- 
iables, there would be no basis for predicting dif- 
ferences with respect to the other patterns of life- 
situations, and none were found. 


Is Word Productivity Related to Picture 
Preference? 


Several hypotheses can be advanced regarding 
the verbal output of children on pictures which 
they like best or like least. One would be that 
children have a great deal to say about what 
pleases them, and very little about situations 
which they dislike. Itcould be, too, that children 
take for granted the desirability of their prefer- 
ences, but find it necessary to substantiate their 
rejections with extended statements. On the other 
hand, it may be that it requires as m uch verbal 
activity to defend choices as it does to substanti- 


ate rejections. 
Data for the examination of this question were 


provided in Part Ш of the Social Perception Inter- 
view. The subject was asked to designate “Ше 
picture you wouldlike to keep for your very own" 
and to tell why. He was given a сору of this pic- 
ture and requested to select a second, and then a 
third in similar fashion. Finally, he was asked 
to point out “ше picture you wouldn't like to 
keep, ” and to tell why. 

- Productivity for the first, second, third, 
and least preferred and the 10 pictures which were 
not mentioned is givenfor the total group ІП Table 
XVIII. and for the comparative groups in Table 
XIX. | The skewness of each distribution toward 
the lower end of the scale in the former table 
corroborates the finding of the preceding section 
that deviant responses are more likely to be in 
the direction of greater productivity rather than 


aE in mean Word Productivity for 
the various designations of likes and dislikes as 
well as for the С o m bined preferences and non- 
selected pictures are given in Table XX.. The only 

stically significant differences 


instances of stati | 
occur in the comparison of the Intelligence groups. 


The difference between the productivity of the 
third choice and least preferred picture is signif- 
icant at the 5 percent level, and that between the 
second and least preferred 15 significant at the 1 
percent level of confidence. In both cases, the 
productivity of the rejected picture is higher than 
that of the preference pictures. 

The evidence indicates that, generally speaking, 
word output in the free-association responses 
(Part I) is no predictor of picture preferences in 
Part Ш of the Social-Perception Interview. It 
would seem that whatever the causative factors 
operating in Part I, they аге notidentical to those 
influencing picture-selection behavior in Part ш. 
It may be that telling a story about a situation 
draws upon the individual’s background of experi- 
ence whereas, in the selection of pictures, values 
and interests are brought into focus. The differ- 
ence is between what the subject knows and is fa- 
miliar with (Part I) and what he would like to have 
or hopes to be (Part Ш). Whether background or 
aspiration is involved, the same number of words 
is required for the expression of one as for the 
other. 

Although this explanation also applies to the 
Low Intelligence group, it does not explain why 
exceptions should occur in the relationship be- 
tween second and third choices with that of the 
least preferred picture. One explanation is that 
less intelligent children have more limited goals 
than bright children, and that once having select- 
ed the ‘‘best’’ picture they were less able or less 
motivated to discriminate among the remaining 
pictures for second and third choices. 


Conclusions 


A number of important conclusions may be 
reached regarding the free-association responses 
to the Life-Situation Picture Series. First, Word 
Productivity is an index of behavior. Pupiis did 
not respond to the pictures in a random fashion; 
neither were word responses a reflection of cer- 
tain innate pupil abilities or characteristics of the 
instrument. Second, there isa relationship be- 
tween children's social background andtheir Word 
Productivity on specific life-situation pictures or 
to patterns of these situations. These differences 
were related to intelligence, grade, and sex in de- 
scending order, there being none associated with 
the rural and urban populations sampled in this 
study. Third, the behaviors elicited in the free- 
association and picture-preference phases of the 
Social-PerceptionInterview are different. Where- 
as an analysis of Word Productivity in terms of 
children's social background did reveal significant 
relationships, this was not the case for the anal- 
ysis in terms of picturechoices. This is in keep- 
ing with the design for the Social-Perception In- 
terview which was planned to bring about increas- 
ing involvement on the part of subjects. 
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TABLE XVII 


WORD PRODUCTIVITY OF PICTURE PREFERENCES FOR TOTAL GROUP 


Total 
Non- 
Preferred Selected 
Words 1st 2nd 3rd Least (Ist, 2nd, 3rd) (М-10) 
210-219..... 5 
200-209..... 4 
190-199..... 5 
180-189..... 1 1 0 
170-179..... 0 0 0 
160-169..... 0 0 4 
150-159..... 0 1 1 2 1 
140-149..... 0 0 0 1 0 2 
130-139..... 0 0 0 1 0 4 
120-129..... 0 1 0 0 1 8 
110-119..... 0 1 3 + 4 11 
100-109..... 2 5 3 2 10 10 
90- 99..... 2 0 3 2 5 20 
80- 89..... 2 1 1 1 4 25 
10- 19... 5 1 2 6 8 26 
60- 69..... 6 5 4 2 15 49 
50- БО ас, Ч 9 4 8 20 66 
40- 49..... 6 12 12 12 30 100 
305-1891... 12 14 11 19 37 137 
20- 29..... 18 15 20 10 53 183 
10- 19 РУР 20 14 15 16 49 169 
Uc б» 7 9 8 6 24 65 
. шэг св 5n 39.72 39. 56 
3.22 3.35 -4 13 "е ve 
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TABLE XIX 


WORD PRODUCTIVITY OF PICTURE PREFERENCES FOR COMPARATIVE GROUPS 


1 
Picture Preference Total 
Preferred Non- 
Comparison Group | Statistic 1st 2nd 3rd Least (1, 2, 3) Selected 
N 44 44 43 44 131 441 
— Mean 37.73 39.77 35.93 40.45 37. 82 38. 38 
50 34.19 34.67 31.68 30.69 33.21 31.53 
r SEM 5.16 5.23 4.83 4.63 ж ж 
Community N 44 44 44 43 132 441 
— Mean 39.09 43.41 44.55 40.58 41.59 40.74 
5р 24.62 24.97 30.57 28.23 26.77 27.41 
ЗЕМ 3.11 3.72 4.61 4.30 * * 
| N 44 44 44 44 132 440 
a Mean 41.14 40.91 40.23 39.77 40.76 39.48 
y SD 32.15 28.47 33.38 28.08 31.05 30. 06 
4. 85 4.29 5.03 4.23 * * 
ын 44 44 43 43 131 442 
35.68 40.00 40.35 43.61 38. 66 39. 64 
26.98 32.17 29.31 30.28 29. 30 29, 07 
4.07 4,85 4.41 4.62 * * 
N 44 44 43 43 131 442 
— Mean 34.11 34.77 38.72 41.05 36. 07 36.15 
ane sD 26.53 27.32 35.32 34.24 29.74 30.06 
SEM 4.00 4.12 5.39 5.22 * * 
Grade N 44 44 44 44 132 440 
: Меап 42.05 46.14 41.82 42.27 43. 33 42.98 
Sixth SD 32.10 32.15 27.00 23.36 30. 23 28. 66 
SEM 4,48 4.85 4.01 3.52 * * 
N 16 16 16 16 48 160 
! Mean 51.25 55.13 55.44 48.56 54. 38 51.38 
High** SD 29.81 38.48 34.52 31.17 33. 61 30. 50 
7.45 9. 62 8. 63 7.79 * * 
| Intelligence N 16 16 16 15 48 161 
Mean 35.75 25.38 29.06 35.53 30. 42 31.96 
Low** sD 43.01 24.71 26.89 28.82 32.02 33.52 
SEM 10. 76 6.18 6. 72 7.44 | * * 
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4 and Total Non-selected pictures, ЗЕМ was 


he difference in N between Total Preferre 
tely, but combined in formula: 


ЕЕ па (5 + А.) 


* Because of | 
not computed separa 


ОХ,-Ха 7 № + № - 2 №, N2 


жж = 16 subjects instead of 44 as for other groups. 
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In short, Word Productivity provides one 
source of validation fortheinstrument. Gross as 
this measure of human behavior may be, it reveals 
certain significant relationships which, with few 
exceptions, can be explained in terms of what is 
known about children. Equally important, ап апа1- 
ysis of word count, with the exceptions noted, has 
failed to produce relationships whe ге none could 
reasonably be expected. 

A number of generalizations also pertain to the 
nature of Word Productivity itself. From the out- 
set, it was established that there is a difference 
between word knowledge, as commonly deter- 
mined by vocabulary tests, and the quantitative 
aspect of verbal behavior in spontaneous oral ex- 
pression. That an individual has a better grasp 
of word meanings is no indicator of how many 
words he will use in expressing himself. Also, 
it is clear, that broad generalizations about su- 
periority inlanguage facility are over-simplifica- 
tions. No one group of children showed consis- 
tently greater response or lack of response to the 
Life-Situation Picture Series. Word Productivity 
depends on what is being talked about. Studies of 
verbal facility will p roduce different results de- 
pending upon whether subjects are free to select 
topics for discussion or are required to deal with 
prescribed situations. 

Two cautions must be pointed out for the inter- 
pretation of the above findings. First, there is 
considerable indeterminancy regarding Word Pro- 
ductivity when analyzedfrom the standpoint of de- 
viations from each individual’s mean wordre- 
sponse. Because of limited s ampling groups 
many frequencies were zero or too small to be 
treated statistically. То clarify these indeter- 
minate areas, quite obviously, will require larg- 


er sampling procedures. 


the analyses revealed 


The fact that none of 
the question of the 


rural-urban differences raises 


applicability of these findings. Although the county 
being sampled has only five communities, none of 
which are over 15, 000 population, and all the sub- 
jects were farm children, does this constitute a 
rural sample? In similar terms, although the 
urban community is located in the second largest 
urban region in this country, do these children 
represent urban children? Could it be that chil- 
dren living in more isolated rural regions would 
differ from those living in a metropolitan com- 
munity of several million population? It may be 
that to compare rural and urban children in gen- 
eral is another instance of over-simplification, 
and is one explanation for the lack of agreement 
found in studies of these culture groups. 


FOOTNOTES 


* For assistance with statistical computations, 
the writer wishes to acknowledge his indebted- 
ness to the following research assistants: Irvin 
J. Lehmann, Sobhi T. Geraissa, Donald W. 
Hinkkanen, and Mohamed K. Hindy, as well as 
to R. James Evey, Project Supervisor of the 
Numerical Analysis Laboratory, University of 


Wisconsin. 
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THE RELATIONSHIP BET W EEN HANDWRIT. 
ING PRESSURE AND LEGIBILITY OF HAND- 
WRITING IN CHILDREN AND 
ADOLESCENTS* 


THEODORE L. HARRIS and G. LAWRENCE RARICK 
University of Wisconsin 


Statement of the Problem 


THE PURPOSEof this investigation was to ex- 
amine the relationship between handwriting legi- 
bility and point pressure inthe handwriting of boys 
and girls in grades 4, 6, and 10. Previous re- 
search at the college level had indicated that high 
legibility tends to be accom panied by relatively 
low variability in point force, while low legibility 
tends to be accompanied by relatively high varia- 
bility in point force. This study was specifically 
designed to examine the relationship between these 
two factors at younger age levels. 


The Background of the Investigation 


The present researchis one of several studies 
which are being conducted at the University of 
Wisconsin by a Committee for Research in Hand- 
writing. 1 The Committee works asa unit in plan- 
ning the research program. In addition, specific 
aspects of research are delegated to individual 
members of the group. Aspects of handwriting 
currently being studied by the Committee include 
studies of the characteristics of the handwriting 
erms of legibility and its ass ociated 
characteristics of effective handwrit- 
the factors influencing choice of 
handwriting instruments, motor components in- 
volved in the handwriting process, and perceptual 
factors involved in the production and evaluation 


of handwriting. 
The present research is a 


ман ил 


product in + 
factors, the 
ing instruments, 


direct extension of 


* Funds for the support о 


f this study were provided in 
Wisconsin, 


an investigation of the relationship between hand- 
writing pressure and legibility at the college level, 
originally reported in 1955 as a bulletin of the 
School of Education of the University of Wisconsin, 
and subsequently reprinted in the Journal of Ex- 
perimental Education. 2 This study should be con- 
sulted for a discussion of previous research con- 
cerning handwriting pressure and for a presenta- 
tion of the technical details of the instrumentation 
used and the type of records obtained. 

The experimental equipment employed in the 
present investigation was the same as that used in 
the college study. Essentially, the equipment in- 
cluded a specially constructed writing table with a 
metal platen mountedflush with an adjustable sec- 
tion of the table top (see Figure 1). The platen 
rested on a strain gauge system which made pos- 
sible the detection of minute changes in point 
pressure during writing. The signal from the strain 
gauge system was amplifiedby a vacuum tube ат- 
plifier and photographically recorded by a Hatha- 
way recording oscillograph. 

It became clear at an early stage in the analy- 
sis of the oscillographic records that two or more 
records having the same average word or sentence 
force might differ markedly in the constancy with 
which theforce was applied to the page in writing. 
It was felt, further, that if the amount of variation 
in force could be measured in such a way as to 
make comparisons between records of varying 
speeds and levels of average force, a new way 
might be found to explore the relationship between 
handwriting force and legibility. 
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A step toward the solution of this problem was 
made in the development of the force variation 
ratio, the derivation of which is described in the 
college study. 3 This technique of analysis makes 
it possible to meaure the cumulative amount of 
change in the application of force for each word, 
and to derive a ratio between the sum of the devi- 
ations in force from the base line of the oscillo- 
graphic record and the time taken to produce the 
record. The formula for the force variation ratio 


1S: 
FVR- 


Change in mm. -2 (average height of curve) 
Length of base line (time) 


This ratio was used as a major technique in the 
analysis of the oscillographic pressure records 
in the college study. 

Findings of the College Study— Two types of 
findings in the original study will be mentioned 
briefly here, one relating to data gathered con- 
cerning force variation ratio, and the other to the 
relationship found between this ratio and legibility. 

One of the concerns of the original study was 
to investigate the significance and stability of the 
force variation ratio. Accordingly, a test of fine 
motor coordination was devised; namely, to draw 
four one-inch horizontal lines as steadily as possi- 
ble. The point force was recorded and analyzed. 
The stability of this test was indicated by the rel- 
atively high correlation of .88 between the sum of 
the force variation ratio for the first and fourth 
and for the second and third samples, respective- 
ly, for 19 college subjects. The stability of the 
pressure pattern in tasks whichare similar in na- 
ture is shown by the fact that a correlation of .81 
was secured between the force variation ratios 
for the short line test and those of the normal 
handwriting samples for the sam e group of sub- 
jects. This suggests that the force variation ra- 
tio provides a measure of fine motor control in 


handwriting. 
The relationships among ! t 
college investigation as indicated by correlation 


analysis are reported in Table 1. Note that the 
most substantial correl ations i 
variation ratio. Average pressure, on the other 
hand, shows little relationship to any other vari- 
able except variation in force. These fi ndings 
markedly influenced the decision to study further 
the relationship between the force variation ratio 
and legibility in a population of children and ado- 


lescents. 
Methodology of the Current Study 


nd Design— The study was designed 
e technique might be 


g factors studied in the 


Population а. à 
so that an analysis of varianc 


3. Ibid., p. 162. 


used in the treatment of the data. The subjects 
were 144 boys and girls randomly drawn from the 
schools of two small urban communities in south- 
ern Wisconsin in such a way that 12 boys and 12 
girls were secured at eachof three grade levels— 
grades 4, 6, and 10--іп еасһ community. It Should 
be pointed out thatthe population was further lim- 
ited to those children who were right-handed, who 
had had continuous experience in the school sys- 
tems concerned, and who were within the normal 
range of age expectancy for their grade. Mean 
chronological ages of the subjects by sex and grade 
аге given іп Table II. In those cases in which 
more than one school in agiven community was in- 
volved, cases were drawn on a proportional basis 
according to the number of children per grade per 
school. It was felt that this procedure in the se- 
lection of the experimental population would make 
possible a more systematic approach to the study 
of the relationship betweenthe force variation ra- 
tio and legibility as it might be affected by the fac- 
tors of sex and community. 

The Experimental Task— The basic data for 
this study were gathered during the fall and winter 
of the 1955-56 academic year. Since a relatively 
large population of children was included in the 
study it seemed best to move the experimental 
equipment to the two communities in which the 
data were to be gathered. Thecooperating schools 
provided a room for the experimental equipment 
and also aided in securing an experienced teacher 
to assist in helping the children adjust to the ex- 
perimental situation. 

Each child was tested individually, all meas- 
ures being secured on a particular child during 
one testing period. Uponenteringthe testing room 
the child was seated at the handwriting table and 
necessary adjustments in chair height and writing 
angle were made. Each child was asked to write 
his name on a card. The child was then shown a 
card with the typewritten sentence, ‘‘The quick 
brown fox jumps over the lazy dog." He was asked 
to study the sentence until he could write it from 
memory. Each pupil was given an opportunity to 
write the standardsentence several times until he 
learned to spell each word and to reproduce the en- 
tire sentence correctly. Next, afresh card was 
placed over the platenin preparation for recording 
point pressure in writing the standard sentence. 

After determining that the pupil could repeat the 
standard sentence from memory, he was then 
asked to write the sentenceonthecard. Care was 
taken to instruct each child to write in his usual 
way. Atthis time his handwriting pressure Was 
recorded. After completing the reproduction of 
the standard handwriting sample, a second card 
was placed over the platen (see Figure 2). The 
subject was asked to complete the figure with 
either curved or straight lines as indicated on the 
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TABLE I 


RANK-ORDER CORRELATIONS BETWEEN LEGIBILITY TIME, 
AVERAGE PRESSURE, AND VARIATION IN FORCE FOR 
57 NORMAL, SLOW, AND FAST ADULT SAMPLES 


Variables Rho 
Variation in force and pressure . 69 
Variation in force and time . 67 
Variation in force and legibility . 59 
Legibility and time .54 
Pressure and time ‚22 
Pressure and legibility 01 


pRanked from low to high 


Ranked from low to high 
Ranked from Slowest to fastest 
Ranked from best to poorest 


c 


TABLE II 


MEAN CHRONOLOGICAL AGES OF SUBJECTS ву GRADE AND SEX 


ПИ i me ERN 


Girls 
Mean 
M 
Grade N (Months) S. D. N (Months) S. D. 
Grade 4 24 116.38 4.80 24 115.75 4.38 
Grade 6 24 143.88 9.81 24 143.50 11.72 
Grade 10 24 185.13 7.96 24 


186. 08 5. 63 
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FIGURE 2: FORMS TO BE COMPLETED IN THE 
TEST OF FINE MOTOR CONTROL 


card. This test was designed to measure the pu- 
pil's finer motor control in a situation similar to 
that of handwriting. 

Methods of Analysis of Data—In the present 
study, force variation ratios were calculated by 
the techniques used in the college study. How- 
ever, the calculation of the force variation ratio 
for an entire sentence is a time-consuming, la- 
porious job. Accordingly, a study was made of 
the consistency of the force variation ratio for 
adults and for a randomized sample of subjects in 
grades 4,6, and 10 by correlating the force vari- 
ation ratios for each word with the average force 
variation ratio fortheentiresentence. These da- 
ta are presented in Table Ш. The consistently 
high correlation between the force variation ratios 
for individual words and the average force varia- 
tion ratio for the entire sentence may be noted. 
Since the force variation ratio for the words “Чох” 
and ‘‘jumps’’ showed consistently the highest cor- 
relations, the sum of the force variation ratios 
for the phrase ‘‘fox jumps” only was used. In 
other words, the children wrote the complete sen- 
tence but an analysis of the pressure records of 
these two words only was made. As might be in- 
ferred from the data in Table Ш, the time re- 
quired to write the phrase ‘‘fox jumps" correlated 
highly with the time required to write the en- 
tire sentence. Table IV demonstrates that the 
time for writing a short phrase is representative 
of the tempo of writing for the entire sentence. 

Evaluations of the legibility of the handwriting 
samples produced by children and adolescents in 
this study were made by rating the legibility of the 
total sentence on an eleven point experimental 
scale composed of reproductions of the standard 
sentence, ‘‘The quick brown fox jumps over the 
lazy dog.” This experimental scale was а com- 
posite scale made up of samples drawn from pre- 
viously developed handwriting legibility scales 
prepared for grades 4, 6, and 10, respectively. 
The legibility ratings were done by five exper- 
ienced judges who followed standard procedures 


in determining eachchild’s legibility rating. Each 
rater displayed a relatively high degree of consis- 
еу in his own ratings. This is shown in Table 

A generally high level of consistency among 
judges was also established. Median scale values 
for the ratings of the five judges were accepted as 
the best estimate of the legibility of the sample. In 
122 cases out of 144 samples, the following сгі- 
teria were met: 1) four or five ofthe ratings were 
identical with or contiguous to the median value 
(in 116 cases), and 2) acentral tendency was 
clearly indicated (in six cases). Тһе гета ти 
22 samples representing split distributions were 
then considered individually. For example, if one 
sample varied markedly from the majority of the 
ratings, it was re-rated. If the new rating did not 
move toward the medianvalue, the atypical rating 
was discarded and the most typical retained. 

Comparisons with the College Study—A sum- 
mary of certain aspects of the methodology of the 
handwriting study of children and adolescents and 
that of the college study is given in Table VI. 

It will be noted that the current study does not 
constitute a direct replication of the college study. 
In addition to obvious differences in the size and 
level of the population, other important specific 
differences should be noted: 


1. The current study represents a randomized 
sample within purposely controlled factors 
of community, grade, andsex, while the col- 
lege sample was stratified to identify the 
best and poorest writers in a given college 
population. 


2. A single sample of handwriting was utilized 
in the current study, whereas subjects wrote 
anormal, a slow, and a fast sample in the 
college study. 


3. In the current study the handwriting samples 
were rated on an 11-pointscale, while in the 
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TABLE Ш 


CORRELATIONS OF THE AVERAGE FORCE VARIATION RATIO FOR THE EN- 
TIRE SENTENCE WITH THE FORCE VARIATION RATIOS OF INDIVIDUAL 
WORDS FOR 30 SUBJECTS RANDOMLY DRAWN FROM GRADES 4, 6, 
AND 10 AND FOR 19 ADULT COLLEGE SUBJECTS 


4th Grade 6th Grade 10th Grade Adults 
Word (N=10) (N=10) (N=10) (N=19) 
quick ‚856 .818 .723 . 933 
brown . 809 . 824 . 847 . 919 
fox 924 . 884 „918 . 943 
jumps . 908 . 897 . 944 . 945 
оуег . 883 863 914 . 924 
the . 780 719 957 . 940 
lazy . 819 827 . 800 890 
dog . 914 . 804 «9217 . 956 


ЛОИ‏ هت 


TABLE IV 
CORRELATIONS BETWEEN WRITI 


NG TIME FOR THE PHRASE < 1 
AND THE TOTAL SENTENCE w RASE “РОХ JUMPS” 


RITING TIME FOR 30 SUBJECTS RAN- 
DOMLY DRAWN FROM GRADES 4, 6, AND 10 


N г 
ы 10 . 963 
10 10 
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TABLE V 


CONSISTENCY CHECKS ON HANDWRITING LEGIBILITY RATINGS OF 
FOUR JUDGES* BASED ON 12 SAMPLES RANDOMLY DRAWN 


FROM A TOTAL OF 144SAMPLES 


Rater 1 
Rater 2 
Rater 3 


Rater 4 


Deviation in Scale Steps from Original Rating 


None 1 
6 4 

11 1 

8 1. 

7 1 


2 


4 


3 


0 


______________________Н86-——= ла == -е 


*The 5th judge was unavailable when the consistency check was made. 


TABLE УГ 


A COMPARISON OF THE METHODOLOGY USED IN THE HANDWRITING STUDY OF CHILDREN 
AND ADOLESCENTS AND THAT OF THE COLLEGE STUDY 


—-——————————————————' 
i 


Current Study of Children 


and Adolescents 


Previous Study of 
College Students 


Description 
of 


Sample 


Number and type 
of handwriting 
samples 


a. 


144 subjects in grades 
4, 6, and 10 


72 boys and 72 girls 


A randomized sample of 
12 boys and 12 girls at 
each of three grade lev- 
els in two communities 


One sample (normal) 
per subject 


Method of 
rating 
legibility 


Rating of normal samples on 
ап 11-point composite scale. 
The scale was made up of 
samples drawn from Grades 
4, 6, and 10. 


19 subjects in junior 
year of college 


9 men and 10 women 


A non-random sample 
of the best and poorest 
writers in a university 
class of 231 students 


Three samples (normal, 
slow, and fast) per 
subject 


Single rank-ordering of 
57 samples (three sam- 
ples per individual) of 
normal, slow, and fast 
writing 
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college study samples were placed in rank- 
order of legibility. In both cases а satis - 
factory degree of consistency was obtained. 


Differences in the methodology of the two stud- 
ies produced certain differences in the data and 
likewise placed certainlimitations upon compari- 
sons of the data in the two studies. It seems 
clear that the random sampling technique in the 
current study tended to maximize the chances of 
amore normal distribution of handwriting legi- 
bility than was obtained in the college study. The 
use of a single sample of handwriting produced at 
anormal rateinthecurrent study probably result- 
ed in less variance in legibility than would have 
occurred had the samples been written at differ- 
ent speeds as in the college study. 

Since the studies likewi 
the method of Securing ra 
using a handwriting sc 
ordering technique, 
er re-evaluated for 


Results of the Study 


While the major e mphasis of the present in- 
vestigation was upon the determination of the re- 
lationship between the two Ї 


point pressure and handwri 


ch the rate iti 
was systematically varied, айын 
Initial Steps in Treati 


constituted th 4 
sources of data on the 144 children е primary 


communities. An analysis of vari 
employed to test the hypoth 
sex, or community differ 
legibility or variability i 

The legibility variable was Selected as the ini- 
tial item for analysis. In Selecting а level of соп- 
fidence which seemed appropriate to the data and 


SS ee 


ance de 
€Ses that no tr 


ences existed f 
п point pressure, 


Sign was 
ue grade, 
ог either 


4. David В. Duncan. ‘‘Multiple Range апа Multiple ғ Tests," 
> 


to the measuring techniques employed, the inves- 
tigators set the one percent level. The results of 
the analysis of variance are given in Table VII. It 
will be noted that the F ratios of 43.18 and 19. 08 
for sex and grade respectively are significant be- 
yond the .01 level, whereas no community differ- 
ences in legibility were obtained. Hence, the null 
hypotheses were rejected for sex and grade, and 
the legibility data for the two communities were 
combined to determine what grade and sex differ- 
ences in legibility did exist. 

Sex and Grade Differences in Hand 
ibility—Grade and sex trends in legibility includ- 
ing those for the college sample are shown in Fig- 
ure 3. It should be noted that the slope of the leg- 
ibility curve for the girls shows substantial im- 
provement in legibility from grade 4 through grade 
10, whereas such is the case for the boys. In fact 
neither the 10th grade boys nor the sample of col- 
lege males achieved the mean legibility standard 
of the 6th grade girls, 

Means and standard devi 
legibility are given in Table 
point scale. While the mean 
регіог to those of the boys а 
Standard deviations are suf 
sure considerable оу 
at grades 4 and 6. 


writing Leg- 


ations for handwriting 
УШ based on the 11- 
5 of the girls are su- 


4 through 10. 
The results of this 


! test showed that there were 
no Significant differen i 


Biometrics, Хї (March 1955), pp. 1-9, 


an 
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VALUE 


SCALE 


LEGIBILITY 


4 6 8 10 12 14 16 
GRADE 


FIGURE 3: MEAN LEGIBILITY RATING 
BY SEX AND GRADE 


13 
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TABLE VII 


SUMMARY OF ANALYSIS OF VARIANCE FOR HANDWRITING LEGIBILITY 


Sum of Mean 
Variation Squares dt Square Е 
5 115. 5624 1 115. 5624 43. 1831% 
ех 
Community . 1736 1 «1138 
Grade 102. 0972 2 51.0486 19.0757ж 
5хС 1.5625 1 1.5625 
SXG 63. 2914 2 31. 6457 11. 8253* 
CxG 3. 3472 2 1. 6736 
SxCxG 5. 5418 2 2.7709 1.0354 
Within Groups 353. 2503 (12)(11)=132 2. 6761 
Total 644. 8264 


143 
* Significant beyond the 1 percent level 


TABLE УШ 


TING LEGIBILITy 
SCORES BY GRADE AND SEX 


Grade 
Level 


Sex Mean SD 
j Boys 7.17 „90 

Girls 6.83 1.73 
қ em 6.71 1.46 

Girls 5.21 2:10 
а 6.71 1.17 

Girls 3.17 Tes 
12 Men 6.22 

Women 3.10 2.44 


| И 
: HARRISON - RARICK 15 
1 
Я. 
| 
| | TABLE IX 
и || 
| | SUMMARY OF ANALYSIS OF VARIANCE FOR FORCE VARIATION RATIO DATA 
a Source of Sum of Mean 
Variation Squares df Square F 
Sex 23, 2900 1 23.2900 4. 6939* 
Community 23. 3900 т 23.3900 4.7140% 
Grade 147. 0966 2 73. 5483 14. 8229** 
| ` 5хС 14.8225 Т 14.8225 2.9873 
1 5ха 2.6689 2 1.3345 
сха 8.6123 2 4.3062 . 8679 
| вхсха 3. 9631 2 1.9816 
| Within Groups 655. 9510 (12)(11)=132 4. 9618 
| Total 878. 7944 143 


зе ^^ = د‎ Bs ي‎ el 
| * Significant at 5 percent level 
**Significant at 1 percent level 


1 TABLE X 
MEANS OF FORCE VARIATION RATIO BY GRADE AND SEX 
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unity differences were significant at the five 
Percent Пан, So that the analysis of the data 
might parallel that for the legibility data, the re- 
sults for the two com munities were combined in 
further examining differences in sex and grade. 

The mean force variation ratios by grade and 
sex are presented in Figure 4. It will be noted 
that with advancing grade level, the force varia- 
tion ratio increases for both sexes through grade 
10. The upward trend continues for college wo- 
men. 

Group data for the force variatio 
given in Table X. Although the co 
were not included inthe analysis of у: 
scores for these sub 
Duncan Range Test was applied to the force vari- 


п ratios are 
llege scores 


improved hand writing le 
expected a decline rather 
force variation ratio. 


el, however, this trend is reversed, the women 
producing highly legible handwriting with low var- 


On the other 


section of this report. 

Grade and Sex Trends in Writing Time qt is 
а common observation that young children write 
more slowly than older chil 


1 Чгеп, although there 
is considerable variability in writing rates among 
individuals of two given ages. The mean time for 


writing the standard sentence is plotted by grade 
and sex in Figure 5. It will be noted that the sex 
differences in rate of writing were not great al- 
though the mean writing time was less at зиссез_ 
sive grade levels. In fact the rate of Writing 
more than doubled between grades 4 and 10. The 
most rapid gains in writing speed occurreq be- 
tween grades 4 and 6, with slightly less ехаррег- 
ated gains between grades 6 and 10 and only а 
relatively slight increase in writi Ng Speed be. 


tween grades 10 and 15. 

Mean values for writing time by grade and sex 
are shown in Table XI. These data show that the 
mean writing time declined at a relatively constant 
rate with advancing grade level for both sexes. In 
general, the standard deviations indicate wide var- 
iability in writing times ехсер ог males in grade 
10 and in the college sample. The Duncan Range 
Test was appliedtothe data for mean writing time 
for grades 4, 6 and 10. No significant sex differ- 
ences within each of grades 4, 6 and 10 were found 
for mean writing time. However, the perform- 
ance of fourth grade boys was significantly differ- 
ent that that of sixth grade boys (р = .05), sixth 
grade girls (p- .01), and tenth grade boys and 
girls (р - .01). The performance of fourth grade 
girls was also significantly different from that of 
sixth grade girls (p=. 05), and from that of tenth 
grade boys and girls (р=. 01). The mean writing 
time of the sixth grade boys was also significantly 
different from that of tenth grade boys (p=, 05). 
These data clearly indicate that a substantial in- 
crease in speed of handwriting may be expected at 


Successive grade levels, and that speed in hand- 


writing appears to be a function of age rather than 
of sex. 


are not dir point pressure 
байна. ectly comparable and are therefore 
Table хп shows th 


+05) were found for both 
een Бтайев 4апа 10 Only. Thus it would 
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TABLE ХІ 


AVERAGE TIME REQUIRED TO PRODUCE THE STANDARD SENTENCE 
BY GRADE AND SEX 


Grade Mean Time 
Level Sex N (in seconds) 
4 Boys 24 119.53 

Girls 24 102. 62 
6 Boys 24 19.92 
Girls 24 65. 64 
10 Boys 24 42.02 
Girls 24 46.85 
College Men 9 32.12 
Women 10 39. 47 


TABLE XII 


MEANS FOR POINT PRESSURE DATA BY GRADE AND SEX 


Girls 


Grade Mean Pressure 
4 Boys m 43.72 26.16 
Girls 24 41.69 5096 
в врза 24 30. 80 10. 61 
ста 24 29. 89 20. 25 
10 нота 24 24. 46 7. 94 
у 24 24.88 1.29 
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and legibility, Pearson product-moment correla- 
tions were run between these two variables at the 
4th, 6th, and 10th grade levels. These correla- 
tions are given in Table XIII. In order to inter- 
pret the apparent discrepancy between the data 
for the college population andthe data for the pres- 
ent study, thecorrelations between the force var- 
iation ratio and legibility of the college population 
are also included. 

It will be noted that all correlations between 
the force variation ratio and legibility ratings of 
the scaled samples for the 4th, 6th, and 10th 
grade pupils are low. It is evident, therefore, 
that under the conditions of this investigation, leg- 
ibility and the force variation ratio appear to be 
unrelated. On the other hand, all correlations 
for the college subjects are positive and at least 
four are moderately high. Thus, whenall57 sam- 
ples (slow, normal, and fast) for the 19 college 
subjects were correlated with the force variation 
ratio, moderately high correlations were obtained. 
The inclusion of the three samples at different 
speeds for each subject tended to broaden the 
range oflegibility as well as that of the force var- 
iation ratio and hence tended to maximize the de- 
gree of correlation. On the other hand, when the 
analysis was restricted to only 19 normal sam- 
ples, the effect of writing speed was minimized. 
Hence, the range of scores for legibility and for 
the force variation ratio was markedly reduced. 
This may, in part, explain the low correlations 
between these variables. It should also be noted 
that in those instances in which the samples were 
rated against the handwriting scales, lower coef- 
ficients of correlation were obtained than when the 
samples were ranked from most legible to least 
legible. In the present study, handwriting scales 
were used for alllegibility rankings. It is be- 
lieved that the use of scales tended to group the 
Scores near the center of the distribution and 
yielded lower correlations than would have been 
found had the samples been placed in a single rank- 
order. This is illustrated by the low correlation 
of . 18 found between the force variation ratio and 
legibility when the normal samples only for the 
college population were rated for legibility on the 
composite scale. This correlation for nor mal 
handwriting samples is comparable to that ob- 
tained for the children and adolescents. It is pos- 
sible that had samples of normal, fast, and slow 
writing for children and adolescents been gathered 
and placed in rank order of legibility, correlations 
of a magnitude similar to those obtained for the 
mixed writing sam ples of the college population 

i have resulted. 
ш анале Between Motor Control and Legi- 
pility—It will be recalled that upon completion of 
the standard sentence each pupil was given а bat- 
tery of eight motor control tests (see Figure 2) 
which involved movements of the hand and fingers 
similar to but not identical to those used in hand- 
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writing. Pressure records were secured on all 
motor control items. Intercorrelations among the 
samples of force variation ratios for the eight 
items were consistently high, ranging from . 609 
to .948. The two items which correlated most 
highly with each other and with all others were the 
motor control items (0 and (0. As may be seen in 
Table XIV, the correlations between the force var- 
iation ratios (FVR’s) for these two items ranged 
from .705 to . 719. Since these two test items in- 
volve rather basic movements in letter formation 
and since the resulting FVR’s are moderately 
highly correlated, it was decided to use the sum 
of the FVR's as the measure of motor control. 

In order to examine further the stability of the 
force variation ratios, the variability in point 
pressure of the words ‘‘fox jumps’’ was correlat- 
ed with the sum of the FVR's of the two motor con- 
trol items. These correlations range from . 353 
to .539. It is interesting to note that while these 
correlations are not high, they are all positive and 
two of the three are significant at the 1 percent 
level and the other is significant at the 5 percent 
level. 

The consistently positive correlations among 
these data suggest that these may be indicative of 
a definite motor function common to both tasks. 
The fact that these combinations are not as high 
as the correlation of .81 attained in a similar set 
of tasks in the college study may mean that a sta- 
bilized motor pattern for handwriting is not 
achieved until college age. 

The correlations between legibility and motor 
control shown in Table XIV are negligible. These 
findings are in agreement with the correlations 
reported in Table XIII between legibility and the 
force variation ratio. А 

Legibility and Force Variation Ratioas a Func- 
tion of Writing Speed—In order to study the effects 
which variations in the individual's rate of writing 
might have upon the relationship between legibility 
and variability in point pressure, data were ob- 
tained on another sample of 20 fourth grade chil- 
dren. These children wrote the standard sample 
at their usual rate of writing, ata slower rate than 
normal, and a rate which was substantially faster 
than normal. The results obtained for these 20 
children are shown in Table XV. In arriving at 
the figures included in Table XV, an index value 
of 1.00 was taken as the average legibility and av- 
erage force variation values at the normal rate of 
writing. Note that the values listed under the head- 
ings Slow and Fast are percentages of the mean 
values for the normal writing rate. For example, 
an index value of less than 1.00 indicates legibili- 
ty better than that produced at norm al writing 
rates, and a value higher than 1. 00 is indicative 
of legibility poorer than that achieved at normal 
writing rates. Conversely, a variability index of 

less than 1.00 indicates relatively low force уагі- 
ability, whereas an index greater than 1.00 is 
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TABLE XIV 


INTERCORRELATIONS OF MOTOR CONTROL DATA AND CORRELATIONS OF 
LEGIBILITY RATINGS WITH MOTOR CONTROL 


4th Grade 6th Grade 10th Grade 
N - 48 N - 46 N = 47 
FVR of (0 with FVR of (0 . 719+ . 705% . 707% 
> (0 (0 with Fox Jumps . 409* . 539% ,959** 
> (0 (0 with Legibility Rating . 109 211 · 009 


аа И И a aaa 
* Significant at the 1 percent level 
**Significant at the 5 percent level 


TABLE XV 


LEGIBILITY AND FORCE VARIATION RATIO AS FUNCTIONS OF WRITING SPEED AT THE FOURTH 
GRADE LEVEL, USING THE AVERAGE FORCE VARIATION RATIOS OF THE NORMAL 
SAMPLES AS THE REFERENT 


ке ек к  __—‏ ا س“ 
Fast‏ 


Slow Normal 


Percent of Normal Percent of Normal 


Adult 1.55 
Adult TT | | 
Legibility с 'gi high 1.00 Child 1.29 low 
ht акы el ب ی چ ی‎ ee а 
Adult 1.52 
Variability in =A T A i.n eed -4 Ба 
Force Child 94 . 
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evidence of higher force variation ratio than at 
the normal writing rate. Data from the с ollege 
population were handled in the same way, using 
the averages of the samples produced at normal 
writing rates as the basis for computing indices 
for legibility and variability at slow and fast writ- 
ing rates. Itis interesting to note that complete 
agreement was obtained between the child 
and adult samples for both legibility and var- 
iability when the sub ject movedfrom the norm- 


al rate of writ ing toeither the fast or the slow 
rate. 


It is evident that when the legibility and the 
force variation ratio вс 


percentage of the score 
ing rates, high legibili 


Conclusions 
22252091005 


This study has tested the hypothesis {аек 
bility and force variation ratio are not signif- 
icantly related in a child and adolescent popula- 
tion. The evidence presented in this study 4 
ported this hypothesis under conditions in nae 
the children were asked to write at their usu 
rate of writing. While sex and grade ашгаасаа 
in legibility were evident, these differences cow 
not be attributed to differences in variability + 
point pressure, However, the data showed tha 
when either children or adults moved away seni 
their normal writing tempo, high legibility ten dde 
to be associated with low variability in урин 
of force and poor legibility was associated и 
high variability in point pressure. In this sens : 
the original hypothesis was not s upported fo 
either children or adults. 5 

The findings suggest that each individual hps 
his own pattern for speed of writing and for E 
ability in the application of force, These я: | а 
be considered to be rather basic components "e 
motor set in handwriting. If speed of handwriti Е 
is increased, variability in application of ppt 
likewise increased, the motor set is disturbed а 
handwriting legibility is adversely affected. ва 
ing ве Committee in Handwriting is now pom 
Ing the scope of its investigation to include oth g 
motor and perceptual aspects of the h andwriting 
process. It is felt that the present study has ей 
vided additional evidence of the importance шээж 
: ntion upon the individual and the stabi ris 
of his performance under varying conditions. Pr 
ould likewise appear to be a worthwhile focus s 
Studying learning Problems in other basic skill 
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Introduction 


OVER THE past decade the development of new 
techniques of measurement in the general area of 
group dynamics has lagged behind the ever- 
increasing emphasis on researchinthis field. Al- 
though a few investigators have devised special 
methods for measuring some of the important 
elements of group interaction (1, 5), most re- 
searchers have, like Lippett and White, invented 
their own unique ‘‘test situations’’ and then em- 
ployed the more familiar method of verbatim re- 
cording followed by detailed analysis (4) This 
paper describes an attempt which has been made 
to devise a “standard situation" in which defined 
elements of group interaction can be systemati- 
cally observed, recorded, and scored. 

The test is an outgrowth of the work done by 
the Mid-Century Committee on Outcomes in Ele- 
mentary Education (2), and as such is geared to 
the measurement of social relations skills among 
elementary schoolchildren. It is believed, how- 
ever, that the technique possesses wide applica - 
bility in areas other than the one for which it was 
originally devis ed, since it has been used suc- 
cessfully with teen-age as well as with adult 
groups in avariety of special fields. But, as de - 
scribed in the following pages, the test is pre- 
sented as a possible research tool which might 
provide data to answer, at leastin part, many of 
the important methodological questions which 
present-day educational techniques and practices 
raise. 


The Test and its Administration 


The Russell Sage Social Relations Test is a 
situational test created to assess the nature and 


ж 


* The development of this instrument w: 


жжТһе writer is in 
contributions to 


quality of two important aspects of elementary 
school children’s skill insocial relations: (1) skill 
in cooperative group planning procedures, and (2) 
skill in techniques of cooperative group action. It 
is designed for children in grades three through 
six and occupies approximately one hour of time. 
Its administration requires a trained examiner 
and a trained observer. 

The test consists of three constr uction-type 
problems, graded in dif ficulty and administered 
one after the other. For each problem the chil- 
dren are provided with thirty-six interlocking con- 
struction blocks of various shapes and colors and 
a model which they are to copy exactly. All thirty- 
six blocks are necessary for the construction of 
each model, there being none left over when the 
problem is completed. Ав shown in Figure 1, the 
first problem requires the children to build the 
figure of a house; the second, asimple footbridge; 
and the third, the figure of a dog. 

The test is administered to an entire class - 
room group of children at once. In the course of 
the administration, the examiner tells the children 
that the purpose of the test is to see how well they 
can work together. He tells them that the test does 
not provide scores for individual pupils, but only 
one score for the class as a whole. Pupils may, 
therefore, help each other, discuss things freely, 
and work cooperatively on the problem. He pro- 
vides eachchildinthe room with one or two of the 
blocks needed to build the particular model shown 
them. He then tells the group that, although they 
can have all the time they need to plan how they 
are going to go about it, they will be allowed only 
fifteen minutes for the actual construction. Final- 
ly, the examiner informs the group that their 
score on the test depends upon the time it takes 


as made possible by a grant from the Russell Sage Foundation. 
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them to build the model, the shorter the time the 
higher their score. 

Having explained the test and its rules to the 
class, the examiner allows the pupils to proceed 
with the task of developing a planof action. When 
this plan is completed and all members of the 
group have it clearly in mind, hetells them to be- 
gin construction of the first model andimmediate- 
ly starts timing. Hestopshis watch when the fin- 
al block is correctly placed and announces the 
score to the class. He then proceeds with the 
second and more difficult problem. The class 
is again given time to plan and organize itself as 
in the first problem. The third and most difficult 
problem is administered in like manner. 

As mentioned above, the test is designed to 
provide information abouttwo different aspects of 
social relations skill. The first is the ability to 
participate in group discussions of plans for fu- 
ture group action; the secondis the ability to reg- 
ulate one's behavior in accordance with these 
plans so that movement to wa rd the agreed-upon 
group goal is facilitated. The test is thus divid- 
ed into two separate parts. The first part is 
termed the Planning Stage—the period during 
which the children devise a plan of action. The 
second part is called the Operations Stage—the 
period during which the group puts its plan into ef- 
fect and actually builds the model, using the 
blocks provided by the examiner. 

In both stages the aim is to obtain a measure 
of the best possible performance the group is cap- 
able of rendering. This aim is accomplished by 
having the examinerplay a precisely defined and 
standardized role inhis administration. His ini- 
tial task is to explain the test and its rules to the 
children. In fulfilling this task his actions and 
statements are the same for all groups. His sec- 
ond and more complicated assignment is to create 
and maintain an atmosphere in which optimum 
group planning is possible. To achieve this pur- 
pose he must vary his behavior in accordance 
with the needs ofthe group. That is, a noisy 
group must be held in check so that communica- 
tion and discussion of ideas is possible; a quiet 
solemn group must be encouraged to express ideas; 
a group that getslost in the complexity of its own 
ideas must be rescued and its thinking clarified. 
Throughout all this the examiner must refrain 
from providing the children with his own ideas of 
ways of solving the problem and from telling them 
which of the ideas they have suggested are good 
or poor. He mustatall times maintaina permis- 
sive atmosphere in which the giving of ideas is 
encouraged and allsuggestions warmly accepted, 
but in which no positive direction is supplied. In 
such an atmosphere the children can perform 
only at the lev el at which they themselves are 
presently capable, and the quality of their plan- 
ning thus provides a measure of the amount of 


skill which they possess. 


The examiner's final task is toinsure that, be- 
fore entering the Operations Stage, each child in 
the group clearly understands the plan which has 
been devised. Therefore, he repeats verbatim the 
details of the plan which the class has built and 
asks if there are further questions before they be- 
gin the construction task. Hereagainthe examin- 
er must vary his behavior with the needs of the 
group, answering questions on points which the 
group has already decided, and referring back to 
the group any questions about points which they 
have nottaken into account. It is not until the chil- 
dren tell the examiner that they are all ready to 
begin work that he permits them to start. This 
insures that allgroups enter the Operations Stage 
with presumably equalclarity concerning the plan 
that they as a group have built. 

During the Operations Stage the role of the ex- 
aminer is also standardized. After the signal to 
start is given he withdraws to one side of the class- 
room and refuses to answer questions or provide 
help, even when requested. Regardless of what 
the childrendo (however noisy, rough, boisterous, 
or quarrelsome they become, or however quietly 
and efficiently they work), he neither reprimands 
nor commends them. He interferes only if behav- 
ior gets so out-of-hand that physical harm to chil- 
dren is apt to result. Ifthis occurs, he stops the 
test and no further problems are given. 

The mechanics of administering the test place 
the children in a situation where progress toward 
the goal is not possible without some type of coop- 
eration on the part of each child in the group and 
where external controls designedto elicit particu- 
lar kinds of behaviors and suppress other kinds do 
not exist. In such a situation the children can 
show the extentto which they arecapable of carry- 
ing out a plan of action they themselves have de- 
vised, and the kind of behavior they exhibit pro- 
vides a measure of the amount of skill they pos- 
sess in this respect. 

During the time the childrenare engaged in the 
task of planning how to solve the problems and in 
working on the actual construction of the various 
models, the observer (seated near the back of the 
classroom) keeps a record of their behavior using 
standardized observation sheets. The data thus 
collected are later transformed into a set of nu- 
merical scores which are used to rank groups on 
the basis of their skill in cooperative group plan- 
ning procedures and in techniques of cooperative 
group action. (A manual containing complete di- 
rections for administering and scoring the test 
can be obtained from ETS.) 


Empirical Development of Scoring Procedures 


The methods of scoring which are discussed in 
a subsequent section are the result of two years' 
research. Table I shows the large-scale adminis- 
trations that have been completed, but fails to il- 
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TABLE I 


EXPERIMENTAL ADMINISTRATIONS OF THE RUSSELL SAGE 
SOCIAL RELATIONS TEST 


=) 


No. of Parti- 


No. of 
Date cipating Schools Grade Classes 

November 1953 One Third 4 
Fourth 4 

Fifth 4 

February 1954 One Third 4 
(retest) Fourth 4 

Fifth 4 

April 1954 Fifteen Fourth 10 
Fifth 10 

Sixth 10 

November 1954 Twenty-one Third 24 
Fourth 13 

Fifth 9 

Sixth 3 

May 1955 Twenty-one Third 24 
(retest) Fourth 13 

Fifth 9 

Sixth 3 
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lustrate the fact that after each administration 
the observation, recording, and scoring tech- 
niques employed underwent drastic revisions. 
Each revision represented yet another attempt to 
break away from the familiar framework of scor - 
ing individuals and to devise a reliable and mean- 
ingful system for scoring groups. 

Such attempts proved to be unusually difficult 
since all previous research in this general area 
has employed techniques which describe segments 
of individual behavior, either by means of a check 
list such as that devised by Bales (1) or by means 
of verbatim recordings of individuals’comments, 
attitudes, actions, etc., as employed by Lippitt 
and White (4). All attempts to apply such tech- 
niques proved unsatisfactory from the start. In 
the first place, the groups dealt with were so 
large that in focusing upon the behavior of specif- 
ic individuals the interaction among individuals 
was lost sight of. Secondly, the varieties of in- 
teraction which occurred indifferent groups were 
so complex that individual recordings failed to 
catch much ofthe important information. Finally, 
a primary aim was to develop an instrument which 
teachers themselves could use with their classes; 
hence complex techniques which required an un- 
due amount of time, effort, special training, and 
expense could not be used. 

Further search of the literature revealed that 
most research on groups of the size in which we 
were interested used sociometric ratings and 
specially designed questionnaires. The few spec- 
ial techniques that had been devised were similar 
to Pepinsky's Group Participation Scale (5), a 
modified ‘‘guess-who’’ instrument which not only 
depended heavily upon the intelligent cooperation 
of the subjects but produced a limited picture of 
only one aspect of group interaction. 

Two important guideposts determined the gen - 
eral direction which the development of observa- 
tion, recording, and scoring techniques for the 
present test should take. The first of these was 
the definition of the ‘‘good’’ group implicit in the 
Social Relations section of Elementary School Ob- 
jectives (2). The second was the general theory 
of behavior outlined by Krech and Crutchfield (3), 
particularly those parts of the theory which relat- 
ed to frustration and aggression. We discovered 
that “good” groups, although few in number, did 
exist and were readily distinguishable from “not 
good” groups. To extract the simple variables 
that reflected these differences ina precise man- 
ner proved a lengthy and time-consuming task. 

The November 1953 administration was соп- 
ducted to see whether the test would “work” 
with elementary school children. The results 
were quite satisfactory, children being both inter- 
ested in and motivated by the problems. Follow - 
ing this administration a beginning was made on 
the development of recording techniques which 
would successfully capture, and scoring tech- 


niques which would accurately reflect, the wide 
differences in the test behavior of different groups. 
Tentative procedures were devised and tried out 
in a retest of the same groups in February 1954. 
Although this retest pointed up innumerable flaws 
in the methods so far developed, sufficient data 
were accumulated to show that over this three- 
month interval some of the groups had improved 
their test performance, whereas others had dete- 
riorated noticeably. А search for the reasons 
which lay behind such changes led to the formula- 
tion of the following hypothesis: changes in test 
behavior over time are a function of: (1) the nature 
of the teaching methods ateacher employs in her 
classroom, and (2) thekinds of controls the teach- 
er uses to make her methods effective. 

To test this hypothesis, the cooperation of the 
elementary school supervisor in a large city sys- 
tem was solicited. The supervisor was made ac- 
quainted with the test and with our ideas regarding 
what it measured and was asked to select for us 
five pairs of teachers at each of three different 
grade levels, fourth, fifth, and sixth. One mem- 
ber of each pair was to be a teacher who placed 
great emphasis upon teacher-pupil planning and 
group work. The other member of the pair was to 
be a teacher who placed major emphasis upon indi- 
vidual instruction and who did not use group plan- 
ning techniques with her children. Since we had 
no advance knowledge as to which member of a pair 
was which, wefelt that if the social relations test 
could discriminate between the two it would indi- 
catethatourhypothesis was a tenable one. Of the 
thirty classes tested in the April 1954 administra- 
tion, twenty-four were predicted correctly. The 
six incorrect pr edictions were found to be due in 
large part to errors and ambiguities which still 
existed in our recording and scoring procedures. 
Accordingly, all procedures were once again sub- 
jected to a few major and several minor revisions. 
The revised techniques were pretested on classes 
in several near-by county schools and were found 
satisfactory. 

The final step was to determine if others could 
be trained to use the test and obtain results which 
corresponded to ours. То answer this question 
we accepted an invitation to participate in a com- 
prehensive teacher evaluation study being conduct- 
ed by the Division of Teacher Education in New 
York City. Thefifty classes tested in the Novem- 
ber 1954 administration and retested in May 1955 
provided opportunity to experiment with the train - 
ing of four different teams of examiners, one ad- 
ministrator and one observer per team. 

The November administration was dishearten- 
ing, for we found that several of the behaviors to 
be observed and recorded were still not defined 
with sufficient objectivity to render them reliably 
Scorable by persons not intimately acquainted 
with the test. TheretestinMay, however, proved 
more successful, probably because between the 
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November and May administrations much effort 
was devoted to spelling out in more precise be- 
havioral terms the variables we were interested 
in. It wasfound that a person with the equivalent 
of a bachelor's degree in education who has had 
some teaching experience can be trained to admin- 
ister the test in approximately two days, working 
with two or three classes each day. Observers, 
however, require approximately four or five days 
of training at the rate of at least two classrooms 
per day. Funds were not available for having two 
observers in each classroom, hence reliability 
data could not be collected. The writer worked 
with each observer until an estimated reliability 
of .80 was obtained on at least two classes. The 
observer was then on his own. 

Despite the fact that continuous experimental 
revisions preventedour obtaining and reporting a 
Series of coefficients descriptive of the reliability 
and validity of the test, it is felt that valuable evi- 


dence of a nonstatistical and Subjective type can 
be reported. 


1. Variations in class size ranging between 20 
and 30 seem to have no effect upon chil- 
dren's test behavior. If a group is larger 
or smaller, test behavior apparently is al- 
tered in direct proportion to the amount of 
departure in size from this range. For ex- 
ample, communication is much easier ina 
group of 6 or 7 persons than ina group of 
40 or more persons, but of app roximately 


equal difficulty in groups of 20 and groups 
of 30. 


- Persons with some educational background 
can be trained easily to administer the test 
using the standardized procedures which 
аге necessary for adequate and reliable as- 
Sessment of group behavior. 


3. Definite evidence has 
that children's behay 
reflects in some degr 
ity of the education 
by the teacher, 
ports, but is dir 
sults of the Lipp 
onthe social cli 


been collected Showing 
ior in the test situation 
eethe nature and qual- 
al experiences provided 
This finding not only sup- 
ectly in line with, the re- 
itt and White experiments 
mate of children’s groups (3). 


4. Tentative evidence has bee: 
support of the hypothesis t 
trative set-up of a schoo 
portant d et e rminant of children’s test be- 
havior than the classroom methods em- 
ployed by aparticular teacher, The scores 
of classroom groups of children Seem to 
vary less within schools than they do be- 
tween schools. A school with a warm 
friendly principal, whose teachers appear 
to have а great deal of individual freedom 


п collected in 
hat the adminis- 
l is а more im- 


and good inter-staff relationships, tends to 
have classrooms whichscore relatively high 
on the test. A school in which principal and 
teachers are stiff, cold, fearful of the test 
itself, and aggressive toward the examiners 
tends to have classrooms whichscore rela- 
tively low. 


5. Retesting children after alapse of from three 
to six months and using the same problems 
has been found to be a feasible procedure. 
The second administration is always given 
with the instruction, **We would like to see 
how much better you can do this time, and 
with but rare exceptions children enter into 
the task as eagerly and as cooperatively as 
they did at first. Changes in behavior from 
one administration tothe next apparently re- 
flect the nature of the classroom learning 
experiences children have been exposed to 
in the interim, rather than learning how to 
take the test. 


6. Quite by accident it has been found that wa 
many ways the test is as applicable for үнэн 
groups as for children’s groups and can, wi 
appropriate modifications, be scored in the 
same way. We have found, for example, 
that an adult group unskilled in tec h niques 
of cooperative group planning Scores defi- 
nitely below a third-grade group which has 
received training along these lines, and that 
an adult group highly skilled in such tech- 
niques surpasses a highly skilled children 5 
group only inthe complexity and sophistica- 
tion of their final plan of action. 


The problem of scorer reliability, however, 15 
Still unsolved and it is felt that, before the tech- 
niques described in the 1011 owing section can ђе 
used with confidence, they must be subjected to 4 
detailed and rigorous statistical analysis. Re 
search on this problem is now in progress. 


Recording and Scoring Procedures 
coring Procedures 


Since the types of scores a situational test pro- 
vides are determined by the specific behaviors 
which are observed and recorded, a first step in 
scoring is that of setting up behavioral definitions 
of the skills the test is designed to measure. 

On the basis of the material presented in the 
Section on Social Relations in Elementary School 
Objectives (2), as well as on that found in the lit- 
erature of education and of group dynamics, а 
group skilled in cooperative planning tec hniques 
can be said to be one characterized by a wide - 
Spread, interested exchange of appropriate ideas 
in which definite growth in the direction of great 
er clarity and precision of thinking occurs 28 ын 
discussion progresses. In addition, the skille 


= 
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group possesses asubstantial amount of autonomy 
in that ће members can, without external help or 
restraint, conduct themselves during the discus- 
sion in such a way that progress toward their 
goal of devising a mutually satisfactory plan of 
action is implemented rather than impeded. A 
group skilled in techniques of cooperative group 
action can be said to be characterized by a har- 
monious working atmospere in which there is 
widespread and interested concern on the part of 
all members regarding rapid and efficient pro- 
gress toward the group goal. 

Using these definitions as guides, two sets of 
relatively independent variables (one for the Plan- 
ning Stage and one for the Operations Stage) were 
developed for scoring purposes. For the Plan- 
ning Stage the variables are: 


1. Participation—The extent to which individ- 
ual children in the group enter actively in- 
to the planning discussion. In a skilled 
group many children participate; in an un- 
skilled group few children participate. 


2. Involvement—The extent to which children 
exhibit interest in and concern for the task 
at hand. In a skilled group the majority of 
the children evidence an active and sus- 
tained interest in the planning. In an un- 
skilled group interest is quite low and may 
be completely lacking. 


3. Communication— The way in which children 
exchange ideas during the discussion per- 
iod. Ina skilled group children listen to 
each other, critically and constructively 
evaluate each others ideas, and integrate 
simple ideas to build impro ved and more 
complex ones. In an unskilled group chil- 
dren act independently, and there is no real 
or fruitful exchange of ideas and sugges - 


tions. 


4. Autonomy— The extent to which children сап 
discuss the problem and reach a point of 
decision about a final plan without the help 
or restraint of the examiner. The skilled 
group is almost completely independent of 
the examiner. Theunskilled group requires 
continuous help and/or restraint to get 
through the planning period. 


5. Organizational Techniques— The kinds of 
ideas children have about ways of organiz - 
ing themselves to attack the problem. Ina 
skilled group the major part of the discus- 
sion is centered around concepts of leader- 
ship and organized subg r oups. In an un- 
skilled group the discussion centers around 
the general idea of having each child work 


as an individual. 


6. Final Plan— The quality of the plan which the 
children eventually devise. In a skilled 
group the plan is precise and detailed, with 
every child in the group kno wing exactly 
where he is to go and whatheis to do. In an 
unskilled group the plan is essentially non- 
existent, being so vague and indefinite that 
no child in the group has any clear idea of 
what his responsibility, as an individual, is 
with regard to building the model. 


For the Operations Stage the variables are: 


1. Involvement — The extent to which children 
exhibit interest in and concern for accom- 
plishing thetaskat hand. In a skilled group 
the majority of thechildren evince sustained 
interest in the progress being made either 
by themselves or by those who are engaged 
in doingthe building. In an unskilled group 
many children withdraw from the field alto- 
gether and pursue non- problem-centered 
activities. 


2. Atmosphere— The psychological tone-qu al- 
ity of the group of children who remain in 
the problem field as revealed by the kinds 
of statements the children make and by the 
tone of voice in which the statements are 
made. In the skilled group, problem-solv- 
ing is carried on in a warm, friendly, sup- 
portive, and harmonious atmosphere. In 
the unskilled group Ше atmosphere can range 
from an extreme and unnatural quiet to 
screaming excitement or open fighting. 


3. Activity— The kind of behavior children who 
have withdrawn from the problem field en- 
gage іп. In the skilled group few or no chil- 
dren withdraw, and the few who do engage 
in acceptable classroom behaviors. In the 
unskilled group many children tend to with- 
draw, and the majority of them participate 
in highly unacceptable classroom behaviors. 


4. Success— The time necessary for the group 
to complete the task of building the model. 
The skilled group can complete the task in 
ten minutes or less. The unskilled group 
seldom finishes within the allotted fifteen 
minutes, or their behavior becomes so out- 
of-hand that the test must be stopped before 
the fifteen minutes are up. 


Space does not permit a detailed report of the 
standardized directions which have been developed 
concerning the specific behaviors which are ob- 
Served, the way in which they аге recorded, and 
the way in which they are combined to provide nu- 
merical scores on each of these ten variables. 
For present purposes it must be sufficient to state 
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that on the basis of the data recorded on the ob- 
servation sheets every group taking the test re- 
ceives а score of from 1 to 5 on each variable, 1 
representing the poorest score and 5 represent- 
ing the best score. These scores are then сот- 
bined to yield а single descriptive rating which 
provides a meaningful picture of the group’s so- 
cial relations skill. 

In the beginning phases of the research an at- 
tempt was made to combine the six variables of 
the Planning Stage and the four variables of the 
Operations Stage in such a way that groups could 
be ranked from high to low along each of two con- 
tinua designated as skill in cooperative group 
planning and skill in cooperative group action. It 
was soon found, however, that a simple ranking 
of this type disregarded much of the information 
that was deemed educationally and psychologi- 
cally significant. Тһе reason was ап obvious one. 
Skill in cooperative group planning does not con- 
sist of one but rather of several interrelated and 
interdependent component skills. The same is 
true of skill in cooperative group action. Groups 
can (and do) vary from each other in any one, two, 
three, or all of these component skills as well as 
in different combinations of them. It became es- 
sential, therefore, to devise a conceptual frame- 
work for describing differences in the behavior of 
groups that could handle variations in several 
variables rather than variations in a single vari- 
able. 

Study of the test performance of many differ - 
ent groups of children revealed that in each stage 
of the test certain variables acted as limiting fac- 
tors on others, These factors, termed limiting 
variables, were of such a nature that if a group 
received low scores on them the group was pre- 
vented from receiving high scores on any of the 
other non-limiting variables. The converse was 
not true. Groups who received high scores on 
these limiting variables did not nece ssarily re- 
ceive high scores on the non-limiting ones, but 
could range in score from very poor to very good, 
This finding eventually led to the postulation of 
two axes, one composed of the limiting variables 
and the other composed of the non-limiting ones. 
These axes were then alignedin such a way that a 
group’s position on the one determined the range 
of positions it could occupy on the other. 

Examination of the psychological nature of the 
variables which determine a group’s position on 
these two axes revealed that the limiting variables 
were related to the amount and kind of discipline 
children exhibit in the test situation, The non- 
limiting variables concerned the amount of tech- 
nical knowledge children seemed to Possess about 
how to organize themselves to accomplish a group 
task. The reason for the somewhat peculiar re- 
lationship between the several variables which go 
to make up skill in social relations now becomes 
clear. The technical knowledge which members 


of agroup possess cannot be put to use unless 
members also possess ап adequate amount of self- 
discipline. That is, children (and adults too, for 
that matter) might be aware of what ‘‘ought’’ to be 
done in order best to accomplish a specific group 
goal, but, unless each individual conducts himself 
in a manner conducive to such acco m plishment, 
optimum progresstoward the goal becomes ПВА 
possible. Оп the other hand, effective sel {-dis- 
cipline on the part of group members cannot, in 
and of itself, serve to make up for lack of the tech- 
nical ‘‘know-how’’ necessary for maximum group 
efficiency. The extent to which a group is able to 
utilize effectively the technical knowledges and 
skills which its members possess is thus limited 
by the extent to which the members are self-dis- 
ciplined. 

Figure 2 shows the way in which two axes are 
aligned in the Planning Stage and in the Operations 
Stage of the test. The horizontal axis has been 
designated Discipline, and a group's placement on 
it is determined by scores received on thé limit- 
ing variables in each stage. The vertical axis 
which bisects it has been designated Efficiency 
(Organizational Efficiency in the Planning Stage 


and Operational Efficiency in the Operations Stage). 


A group's placement on this axis is determined by 
the nondimiting variables in each stage. The 
dashed restraining lines in the figure are purely 
illustrative, serving to indicate that a group's dis- 
tance in either direction from the mid-point of the 
horizontal axis limits, but does not wholly deter- 
mine, the height to which the groupcan rise on 
the vertical axis. 

In the Planning Stagethe variables Involvement 
and Autonomy were found to be limiting ones. In- 
volvement, it will be recalled, is a measure of to- 
tal group interest, and Autonomy is a measure of 
the extent to which a group can function independ- 
ently of the examiner. Onlya group whose mem- 
bers possess a high degree of self-discipline has 
the ability to get th rough the planning discussion 
without examiner interference. Group me m bers 
will maintain interest in the planning discussion 
only to the extent that they feel themselves to be 
needed, wanted, and important members whose 
ideas are worthy of attention and consideration bY 
others. The more this feeling is present, the 
more self-disciplined a group tends to be and the 
more it is able to function independently. Simi- 
larly, the more self-disciplined a group is, the 
more its members tend to consider, as well ae 
seek, the ideas and opinions of others, Autonomy 
and Involvement are thus seen to be inextricably 
related, and without these two conditions the ргоѓ- 
itable exchange of ideas necessary to good grouP 
planning is impossible. 

As Figure 2 suggests, agroup’s lack of Auto. 
omy in the Planning Stage can manifest itself ! 
different ways. Groups can be un-disciplined in 
the sense that all forms of control, both extern 
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Organizational Efficiency 


Discipline 
Zero Max. Zero 
Un- Disc. Self - Disc. Over- Disc. 
(Rowdy) (Task- Centered) (Quiet) 
THE PLANNING STAGE 
Operational Efficiency 
Discipline Jim Mai: Zero 
- Disc. Self - Disc. Un- Disc. 
(NOR Hostia) (Task-Centered) (Hostile) 


THE OPERATIONS STAGE 


FIGURE 2 


SCHEMATA USED TO SUMMARIZE THE DATA OBTAINED FOR 
GROUPS IN THE PLANNING AND OPERATIONS STAGES OF 
THE TEST 
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and internal, are absent, or they can be over- 
disciplined in the sense that external controls 
have been so strongly imposed that the develop- 
ment and/or functioning of self-discipline is not 
possible. In the test situation the completely un- 
disciplined group is a rowdy, noisy, restless one 
in which children pay little or no attention either 
to the examiner or to the task at hand. The ex- 
tremely over-disciplined group, onthe other hand, 
is one in which children are so restrained and 
passive that the examiner can elicit almost noth- 
ing from them. Both groups, inthe extreme case, 
are almost totally lacking in interest and involve- 
ment in the test problems, and this combination 
of factors prevents them from functioning effi- 
ciently during the planning period. 

In the Operations Stage of the test the limiting 
variables were found tobe Atmosphere and Activ- 
ity. Atmosphere refers to the Psychological 
tone-quality of the group of children who remain 
in the problem field and who exhibit concern for 
the construction task. Activity refers to the kind 
of behavior children who һауе withdrawn from the 
problem field engage in. A harmonious working 
or problem-solving atmosphere in which optimum 
progress is possible can, in the present test sit- 
uation, be created and maintained only if each 
group member subjects himself to the rules of the 
plan devised during the discussion period and re- 
frainsfrom engaging in any behavior which will 
impede progress toward the group goal, 
members who have withdrawn from the p 
field must likewise refrain from engaging 
haviors which are so rowdy or so hostile th 
disrupt the work in progress. Atmosphere and 
Activity, therefore, are concerned with the amount 
of self-discipline group members possess, and 
directly affect the efficiency with which the gr 
as a whole can accomplish its task. 

Lack of self-discipline in the Operations Stage 
results in one of two different types of behavior, 
both of which are equally detrimental to efficient 
group accomplishment. As shown in Figure 2, 
children who are undisciplined can evince hostile 
or non-hostile behaviors. In the extreme case 
the undisciplined non-hostile group is опе in which 
the classroom is turned into a playground, chil- 
dren running about with complete abandon. On 


the other hand, the group in which members 
hibit extreme hostility tow: 


in which angry quarrelin 
dominate. In both group: 


Group 
roblem 
in be- 
at they 


oup 


ex- 
ards each other is one 
g and open fighting pre- 
5 Ope rational efficiency 
is so low that it frequently is necessary to stop 
the test within the first few minutes, 

All four of the non-limiting variables which de- 
termine a group’s placementonthe Organization- 
al Efficiency line in the Planning Stage relate to 
those technical skills and knowledges essential to 
building a good plan of action, Since each child 
in the room has one or two oft 


he blocks neces- 
sary to build the puzzle, it is obvious that some 


type of organization is essential to efficient ac- 
complishment of the construction task. И children 
have no knowledge of organizational techniques 
such as those of delegating leadership, appointing 
subgroups to perform certain tasks, or specifying 
when and where different groups are to work, a 
good final plan cannot be built. Possession of such 
knowledge, however, is not by itself sufficient. 
Children must also possess the ability to express 
their ideas, to communicate them to others, to 
listen to the comments and criticisms which others 
might make, and to engage іп that fruitful ex- 
change of ideas and points of view which results 
in the most precise, the most comprehensive, and 
the most detailed plan of action. The extent to 
which group members possess these knowledges 
and skills is reflected by scores on the variables 
Communication, Participation, Ideas, and Final 
Plan. | 

In the Operations Stage the non-limiting vari- 
ables are Success and Involvement. Success is а 
highly objective measure of efficiency based upon 
the time it takes the children to build the model; 
it is dependent upon both the excellence of the plan 
and the discipline with which it is carried to com- 
pletion. Involvement, although still a measure of 
total group interest, is not a limiting variable in 
this stage as it is inthe Planning Stage. This 15 
because skill in techniques of cooperative action 
is revealed in the extent to which children, when 
placed ina completely free and unstructured situ- 
ation, can exhibit and maintain genuine concern 
for accomplishing a group goal. This concern 
may be manifested directly or indirectly. Direct 
Involvement is exhibited by children who remain 
in the vicinity of the problem table, following 
closely the progress being made on the task, and 
also by children who, al though drawing slightly 
apart from the builders, obviously keep an eye ОЛ 
the progress being made, Indirect Involvement 1$ 
evidenced by children who; quietly confident that 
in the Planning Stage they, as a group, have made 
the best possible plan of action, pursue other on- 
£oing classroom activities andleavethosefew who 
have been delegated to perform the construction 
‘ask to carry out their responsibility. 

It is possible for аргоцр to receive ahigh score 
on Success (indicating that the problem was solve 
in a relatively short period of time) because the 
majority of the children refused to accept ће prob- 
lem and withdrew from the field altogether, leav- 
ing the job in the hands of three or four grouP 
members. In such cases the problem is obvious 7 
ly no longer а "group" problem апа the tot 
group's rating on Operational Efficiency is low 46 
Spite their high Success score. Success, mer 
ured in number of minutes it takes to solve ЇЇ 
problem, thus becomes a meaningful measure m 
“skill in cooperative group action’’ only when t 5 
Involvement score is high, indicating that the шин 
jority of group members remained interested ! 
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the accomplishment of the task. For this reason, 
Operational Efficiency must be determined by 
scores on both Success and Involvement variables. 

To summarize, agroup’s placement on the dis- 
cipline axis is determined in the Planning Stage 
by scores on Autonomy and Involvement, and іп 
the Operations Stage by scores on Atmosphere and 
Activity. Placement on the Organizational Effi- 
ciency continuum is determined by scores on Par- 
ticipation, Communication, Ideas, and Plan, and 
on the Operational Efficiency continuum by scores 
on Involvement and Success. 


Classification of Groups on the Basis 
of Test Behavior 


Every group that takes the test is classified on 
two bases: (1) their Planning Stage behavior, and 
(2) their Operations Stage behavior. This classi- 
fication is made using the theoretical framework 
described above for summarizing the data obtained 
for a group. For ease of communic ation, each 
"'group-type" has been assigned a descriptive 
name, as shown in Figures 3 and 4. The dividing 
lines in these figures are by no means rigid, serv- 
ing only to illustrate the fact that the defined areas 
represent points of concentration in a plane. All 
of these group-types have been observed and re- 
corded empirically. Their major characteristics 
are given below. 


Group-Types in the Planning Stage 


Figure 3 shows the seven major types of groups 
that have been defined for the Planning Stage of 
the test. The characteristics of each type is as 
follows: 


1. The Mature Group—Characterized by good 
ideas about ways of organizing themselves 
and by an excellent communication pattern. 
The discussion is widespread, animated, 
self-controlled and well conducted. Fre- 
quently the examiner can retire altogether, 
leaving the total planning session in the 
hands of one or two student leaders. 


2. The Dependent Group—Charac te r ized by 
good ideas about ways of organizing them- 
selves and by agoodcommunication pattern. 
This group, however, possesses less auton- 
omy than the Mature Group, and to get them 
through the discussion the examiner has to 
provide frequent assistance in the form of 
summarizing ideas, suggesting next steps, 
reminding them to speak one at a time, etc. 


- ized by ап 

3. The Immature Group Characterize 
alert and eager interest in the test, сош- 
bined with an almost total lack of skill in 
group problem-solving procedures. Ideas 


about organizational tech niques are to all 
practical purposes non-existent and the 
group generally accepts by consensus the 
first or second idea mentioned, regardless 
of its feasibility. The active interest, how- 
ever, serves to keep the group well disci- 
plined and little in need of examiner aid or 
restraint. 


4. The Semi-Controlled Group—Characterized 
by fairly wide and generally interested par- 
ticipation. The communication patternis 
relatively good, as is knowledge of organi- 
zational techniques and qual ity of the final 
plan. Group discussion is markedly imped- 
ed by the fact that group members tend to 
talk all at once. The examiner is always 
hard put to keep the classroom sufficiently 
quiet andthe discussion sufficiently central- 
ized so that childrencanhear each other and 
really develop their ideas. 


5. The Semi-Restrained Group—C haracter- 
ized by an atmosphere of “dutiful recita- 
tion.’’ Children are quiet, polite, attentive, 
and respond readily to the examiner’s re- 
quests. Their ideas are generally good, but 
the planning discussion tends to be stiff and 
artificial, as though children were doing 
what they feel they ‘‘ought’’ to do rather than 
entering freely, wholeheartedly, and with 
enjoyment into the task at hand. The dis- 
cussion is seldom widespread; it is gener - 
ally carried by one, two, or three children, 
the rest remaining quietly passive and disin- 
terested. 


6. The Uncontrolled Group—Characterized by 
a complete lack of discipline, combined with 
a total lack of skill in techniques of planning 
and communication. 


7. The Restrained Group—Characterized by an 
almost complete and highly unnatural si- 
lence, combined with zero skill in techniques 
of planning and communication. The exam- 
iner must work very hard with this group, 
prodding them constantly to get them to re- 
spond at all. 


Group- Types in the Operations Stage 


In the Operations Stage, as in tbe Planning 
Stage, the results of many administrations of the 
test have shown that groups tend to cluster in a 
few areas along the two axes, Discipline and Op- 
erational Efficiency. As shown in Figure 2, lack 
of self-discipline in this Stage of the test can be 
evidenced in either hostile or non-hostile behav- 
ior. The type of behavior exhibited serves to lo- 
cate a group on the right or left side of the tri- 
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Operational Efficiency 
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TYPES OF GROUPS WHICH OCCUR DURING THE OPERATIONS STAGE 


98 JOURNAL OF EXPERIMENTAL EDUCATION 


i E ievement. This group is distinguished from 
ан ملسا اا‎ QU расар the Roviy mid Ше Aen ion E селе в 
ш әр терін = Шы ‘of the triangle are fact that children who have withdrawn den 
cue er ec ada those falling on the right side acceptable (e.g., non-disruptive) classr 
are hostile ones, those Ы pee are best de- activities. 
i k-centered. The lower a group's 
с. на triangle, Ше less efficient it rs as Non-Hostile Groups 
ating unit, and the further away it is from | 
“эй ачин Tine, the less disciplinedare its mem- 4. The Rollicking Groups—C har ac terized des 
bers. As in Ше Planning Stage, we again find that atmosphere which, although basically as 
the amount and kind of self-discipline group mem- is loud, noisy and generally uproarious. RE 
bers possess serves to limit their efficiency in dren not directly concerned with the bui 
accomplishing a particular task. task crowd around the participants shouting 
The nine types of groups shown in Figure 4 are rections, making joking comments apani PaE 
briefly described as follows: gress, and laughing loudly over any err = 
which аге made. The melee generally ен 
Task-Centered Groups too much for some members, and they wit и 
draw from it altogether—not in anger but b oe 
1. The Mature Group— This group is character- self-preservation. The uproar and lack of ser 
ized by a warm friendly, supportive atmos- iousness lowers this group’s over-all efficiency- 
phere in which all members exhibit sustained 
interest in accomplishing the task rapidly and 5. 


The Excited Group—Children in this group аге 


efficiently. Once childr unable to control their excitement over the task. 


en have performed 
jobs assigned them, 


they either socialize free- 


The atmosphere is char ged with Ее, 
ly, but quietly, until the task is completed or voices aresharp and high, and movements "ru 
pursue ongoing classroom activities. This jerky as children work feverishly at the bui Б 
group completes the task in approximately ten ing task. In the excitement tempers y eta 
minutes orless. Itisdis tinguished from all Short and arguments frequent, but real hor E 
other groups by the fact that those children not ity is absent as shown by the fact that outburs а 
directly concerned with the task of putting are task-centered, rather than personal, ап 
blocks together still remain involved in the are short-lived. The nois е, tenseness, an 
work, excitement prevent children from achieving 2 

high success score, although they frequently 

2. The Immature Group—The atmosphere of this manage to finish before the allotted time is Чр: 

group, although basically warm and friendly, 

is best described as “bumbling.” Children 6. The Rowdy Group— This group is similar to ie 

exhibit a sincere concern for accomplishing the Excited Group, except that here all forms 0 

task, but their lack of technical skill in build- control are so lacking that the excitement gets 

ing an efficient plan of action renders their ef - completely out-of-hand. The screaming, ge 

forts ineffective. The majority mill around ning, shouting, and yelling reach such a рис 

the problem table, putting blocks together in a that the exam iner frequently is forced to stop 

more or less random fashion, trading blocks, the test. This group's efficiency is always 

taking them apart and reassembling them, us- near zero. 

ually to little avail. The warmth and friendli- 

ness of the atmosphere, however, prevents the Hostile Groups 

group from breaking up despite their frustrat- 

ing trial-and-error methods This group rare- 7. The Suppressed Group—C harac terized by a^ 

ly completes the problem in the allotted time, almost abnormal quiet 


almost abnormal quiet. 
-fourths finished the building task speak i 
when the fifteen minutes are up. 


murmur, and it is rarely possible to hear d 

о cific statements. Children not engaged in 

3. The Disinterested Group— This group is char- 1 
acterized by extremely 


1 building task sit quietly and idly in their seat? 
low involvement on the doing nothing. 

part of most members in the class, Because 
only a small group work on the Prob 


: lem, the 8. The Вїскегї — Thi is character~ 
working atmosphere is generally a quiet Sie == i ctering Group— This group is 


Children engag ed E 
n whispers or in a 10 


: К 11у: 

y ized by an atmosphere of contained hostilit 
uri Snape um, поемате) цочин time necessary There is a сон илд flow of negatively criti- 
to complete the task, is high. woe the prob- cal arguments about who should be at the ow 
lem is no longer а group problem, the over- lem tabl to be built 
all rating on Operational Efficiency is near ble and how the puzzle ought 


А The hostilit 
zero despite the high degree of Objective Му 


is 
apparent in these arguments 
usually suff 


1 to 
icient to cause many children 
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withdraw from the field altogether with the at- 
titude: ‘‘They think they’re so smart—let them 
do it." Those who remain continue to bicker 
about how the task should be done; those who 
withdraw lose all interest in the work, sitting 
quietly (though often sullenly) attheir desks or 
engaging in some type of play activity. Fre- 
quently, children hide their blocks so that the 
builders cannot complete their task. 


9. The Quarreling Group—In this group the hos- 
tility gets completely out-of-hand. Quarreling 
over possession of blocks and fighting for po- 
sitions at the problem table generally reach the 
point at which the test has to be stopped. 


Changes in Test Performance Over Problems 


The present test, being concerned with the 
measurement of dynamic social and psychological 
forces which reveal themselves in changes in be- 
havior rather than in consistency of behavior, 
does not possess reliability inthe sense of having 
high inter-problem (e.g., inter-item) correla- 
tions. As a group progresses from the Planning 
Stage of problem one to the Operations Stage, and 
from there to the Planning Stage of problem two, 
etc., thenature of the psychological forces active 
in that particular group are revealed by the pat- 
tern of changes which occur in test behavior. It 
is hypothesized that if these forces are predomin- 
antly constructive in nature, the group will be - 
come more cohesive and more efficient over prob- 
lems. Thus a group that scores Immature on 
problem one planning and Mature on problem one 
operations, generally scores Dependent or Semi- 
Controlled in subsequent planning sessions. If, 
on the other hand, these forces tend to be predom- 
inantly destructive in nature, the group will show 
а more or less rapid trend toward disintegration 
and inefficiency, the strength of these forces be- 
ing revealed in the rapidity with which the group 
reaches the point of complete disintegration. 

The explanation for these phenomena is based 
upon two as sumptions. First, every group, re- 
gardless of the attributes or past experiences of 
its members, contains within it potentialities for 
both constructive and destructive group action. 
Second, the direction which these potentialities 
ultimately take is determined primarily by the ex- 
periences to which the members—as а group— 
have been subjected. Accordingly, groups in 
which constructive forces are stronger than de- 
structive ones are hypothesized to be those in 
which children have been exposed to many happy 
and constructive working experiences in a group 
situation where respect for the abilities, needs, 
and feelings of others is the rule. Groups in 
which destructive forces are paramount are hy- 
pothesized to be ones in which primary emphasis 
has been placed upon individual work and individ - 
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ual accomplishment. Children in these groups 
lack the kind of experiences in which social rela- 
tions skills may be learned, and thus fail in their 
efforts when suddenly placed in a situation which 
demands such skills. Groups in which destruc - 
tive forces show themselves in hostile behaviors 
are believed to be ones in which children’s exper- 
iences have been such as to generate marked feel- | 
ings of rivalry, jealousy, bitterness, and frustra- 
tion. Those in which destructive forces show 
themselves in rowdy, althoughnon-hostile, behav- 
iors are believed to be ones in which children have 
not been taught to handle theirfreedom wisely and 
constructively. There is no frustration, only a 
boredom which breaks into excited and welcomed 
horseplay when external controls are released. 

These and similar hypotheses are as yet quite 
tentative, as are many of the findings here report - 
ed. Muchresearchof a rigorous nature is needed 
before the test can be used with confidence and 
with accuracy. Research on reliability is now in 
progress, further use of the test hinging upon the 
outcome. 


Summary 


This paper describes asituational test useful 
for evaluating the social relations skills of ele- 
mentary school childrenina group problem-solv- 
ing situation. Data accumulated during the devel- 
opment of the instrument indicate that the obser- 
vation and scoring procedures here reported pos- 
sess satisfying reliability, but the rigorously de- 
signed study which will provide definitive answers 
to such questions has not been completed. The 
test is presented as a research technique which 
might prove valuable in studying the effects of var- 
ious kinds of educational environments upon the 
behavior of children's groups. 
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DEVELOPMENT OF AN INSTRUMENT FOR 
STUDYING VERBAL BEHAVIORS IN A SEC- 
ONDARY SCHOOL MATHEMATICS 
CLASSROOM 


E. MURIEL J. WRIGHT 
Washington University 


MUCH OF the research relating to the improve- 
ment of classroom teaching has been dependent on 
measurements carried out inthe pre-lesson 
and/or the post-lesson periods. This approach, 
valuable as it is, is indirect, and there is need 
for a more direct approach. It seems reasonable 
to suppose that direct study of the lesson by an 
observational technique would avoid certain of the 
variables encountered in the indirect approach and, 
at the same time, should possess intrinsic valid- 
ity. Simultaneous consideration of the subject 
matter taught and the methodof its development— 
two interdependent facets of the lesson—could al- 
so be achieved. 

There follows, hereafter, a description of the 
design of an instrument for direct observation of 
the verbal interaction of teacher and pupils in a 
mathematics classroom. Support for this design 
is sought in a brief empirical study of aspects of 
validity and reliability of such an instrument in- 
cluding effective ways of reporting the obse rva- 
tions. 


4, Design of the Instrument 


Categories related to objectives of mathema- 
tics teaching formed the basis of the instrument. 
Trained observers, sampling the verbal interac- 
tion of teacher and pupils, classified behaviors by 
means of the defined categories. This permitted 
a quantitative record of the emphasis of the select- 


ed objectives. 


The Categories 


Fairly general aims in the teaching of весоп- 
dary school mathematics were selected to provide 
a basis for judgment of the developing lesson. 
These were sought in the literature dealing with 
general educational objectives, in the literature 
concerned particularly with mathematical objec- 
tives, in writings of specific mathematical inter- 
est and from personal teaching experience. 

Categories for the classification of classroom 
behavior were selected from these statements of 
aims of mathematics teaching in the light of three 


criteria. Each aim was capable of careful defi- 
nition, important for general and for mathemati - 
cal education, and feasible of attainment in the 
secondary school. These categories fell natural- 
ly into three frames of reference: ability to think, 
appreciation of mathematics, and attitude of cur- 
iosity and initiative. 


Frame A. Conscious developing (teacher) or use 
(teacher or pupil) of ability to think by, 1. an- 
alyzing, 2. synthesizing, 3. specializing, 4. 
generalizing. 

Frame B. Conscious developing (teacher) or dem- 
onstration (teacher or pupil) of appreciation of 
mathematics—an evaluative approachto, 1. the 
methodology of mathematics, 2. the subject 
matter of mathematics, 3. the place of mathe- 
matics in other fields and areas, 4. the place 
of mathematics in history. 

Frame C. Conscious fostering (teacher) or dem- 
onstration (teacher or pupil) of an attitude of 
curiosity andinitiative, 1. enthusiasm for fresh 
knowledge, 2. independence of thoughtand ac- 


tion. 


The requirements of an instrument for system- 
atic observation (exclusiveness, e xh austiveness, 
and unity) had also to be met by this definition of 
categories. Logically and by practical explora- 
tory use, these frames (and within each frame the 
several categories) were deemed to be exclusive. 
Provision of a neutral category for behaviors not 
falling in the defined frames ensured exhaustive 
classification. Finally, because the aims were 
defined to prove necessary and sufficient to en- 
compass the aims of mathematics teaching, unity 
was assumed. 

Detailed definition of the positive and negative 
aspects of each frame and of the several categor- 
ies of each follows in Table I. 


The Classification of Behaviors in Terms of 
the Categories 


Frames of Reference— These aims of ability 
to think, appreciation of mathematics, and an at- 
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iosity and initiative constituted three 
а posce eng Each verbal be h avior oc- 
curring in the classroom was viewed in all three 
frames and was classified under one of the cate- 
gories of each. This is shown in Table I. The 
first particular example of behavior is: 


“(Тһе problem: to show that the roots of a 
particular equation are real and unequal) 
Pupil: ‘If the roots of 16x? - 16x - 51 = 0 
are real and unequal, then we should find 
the discriminant greater than zero.’ ” 


This was classified under Ability to Think as An- 
alyzing, positive, or A,+. In addition, this ex- 
ample occurs under Appreciation as Subject mat- 
ter, positive, or В;+, and under Attitude as Inde- 
pendence, positive, or С». When not readily 
classifiable under at leasttwo of the above frames 
a behavior was recorded as Neutral. 

Single Behavior— The behavior observed was 
the verbal responses of teacher and pupils plus 
any concurrent blackboard development, vocal in- 
flection or facial expression contributing tothe 
meaning of this verbal interaction. 

The single behavior was obtained bytime sam- 
pling; the behavior of thefirst speaker ina fifteen- 
Second interval. During each minute of observa- 
tion two behaviors were classified. Of the four 
fifteen-second intervals occurring, the first and 
third were used for observation, the second and 
fourth for classification. 

Arrington (1) indicated str ongly that the role 
of time sampling in observational techniques de- 
pends onthe ability, by its use, of obtaining more 
reliable information than would be be obtainable 
by other means. For this instrument two practi- 
cal difficulties directed its use. First, it was 
found that many naturally complete behaviors, if 
artificially divided, mightbe classified in parts— 

losing, however, their contextual meaning. The 
close following of the individual phrase essential 
in the study of the interaction in the c1 assroom 
might result readily in such Sub-classification of 
complete behaviors. Second, because judgment 
and recording inthree frames simultaneously was 
required, short periods had to be taken from the 
continuous interaction to complete this. Thus, 
omission of classification of variable amounts of 
the natural interaction would take place. Com- 
plete elimination of these difficulties would occur 
only if adequate recording devices, 1.е., sound 
films, were practical. However, the two difficul- 
ties were diminished by time sampling: first, an 
interval of sufficient length to ensure classification 
of most behaviors was chosen; second, use of al- 
ternate intervals for classification permitted as- 
sessment of the amount of interaction unobserved. 

Unit of Behavior— The single l es son provided 
a natural unit for obser vation of the classroom. 
Because of some variation inlength of period from 


one class to another, it was decided to limit ыг 
observation to forty-five minutes. А total 55, 
quency of ninety behaviors (two рег minute) | " 
thus obtained for each standard unit of obser yang 2 
The observer viewed and classified ao ae 
from a desk atthe back of the classroom using GU” 
plicated recording sheets anda stop-watch. 


The Observers 


The classification of behaviors under the ей 
fined categories was determined by the understan : 
ing, interpretation and inference of the Geen 
Care, therefore, was directed to adequate ргера а 
ation in terms of mathematical understanding Та 
of knowledge of the experimental procedures. гн 
principal observer, 1, was the investigator hol 
ing degrees in mathe matics and in education iê 
with experience in the secondary classroom. Т я 
assistant observer, 2, was а fourth-year studens 
at the University who was planning to bee 
teacher of mathematics in the secondary ا‎ 
This student was majoringin mathematics and je 
completed most of her professional cou rses, bu 
had no classroom teaching experience. өрг 

Experience and Training of the Observers Е 
server 176 understanding of the problem had been 
developing during an eight-month period of exper 
imental design. Her experience in recording was 
in the use of successively modified schedules 20 
amounted to approximately sixty clock hours. ft 

Observer 2 was trained as follows: Gener f 
discussion of the purpose, scope and methods 0 
the problem was followed by detailed study of the 
arbitrary definitions of behaviors as collected 2 
an Obervers’ Manual. Thereafter, three vis} 8 
occurred for practice in classification of behavior? 
in the classroom. Beginning with five minute PF 
iods of classification ina single frame only, at m 
third visit the full 45-minute unit was studied : 
allthree frames. Further experience was gaT a 
by repeated classification of filmed іліегасћот» 
and by observation of fourteen units in the clas 
room itself. His 
Subjects—Seeking to minimize the effect of the ° Бе 
server оп the subject, Observer 1 met all of t = 
principals and most of the teachers before the рег 
iod of observation began, presenting the propos, 
and ans wering resultant questions. Detailed i 
Scription of theobservational device was ригро o 
ly avoided. This was explained in terms of te 
possible behavioral effects and was acceP Б 
readily. On the first day of observation the ue 
Servers were introduced to the pupils as visito 
from Washington University interested in te? nts 
ing. Probably because of the custom of stude 
from the University visiting in these schools, 


à ter 
Pupils generally did not show any particular us 
est in the observers. 
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a brief outline of results obtained was sent to each 
teacher after the analysis was complete. 


The Classes 


The twelve algebraclassrooms in which the in- 
strument was studied were in secondary schools 
in Metropolitan St. Louis іп a setting of higher 
than average economic status. The classrooms 
were selected to be comparable in terms of size 
and organization of school, academic and profes- 
sional qualifications of the teachers, broad admis- 
sion requirements to high school, and heterogen- 
eous grouping as far as mathematical ability was 
concerned. 

Most of the lesson period was spent in group 
discussion conducted with the teacher at the black- 
board, the pupils at their seats. There were oc- 
casional periods of supervised study. 


Testing of Validity 


At this stage in the development of this instru- 
ment two types of validity—content and construct 
—and reliability were considered. 

Assessment of content validity included study 
of the 15-second division at alternate intervals as 
a valid sample of the natural behaviors of a unit; 
also considered was the number of units needed 
to make adequate representation of the mean be- 
haviors of a topic. 

The study of construct validity was directed to 
assessment of logical relationship between behav- 
iors revealed by this instrument and general 
teacher and pupil characteristics. 

Reliability of the observer was assessed in two 
ways—agreement between observers, and agree- 
ment between repeated classifications of the same 
filmed interaction by the single observer. 


The 15-Second Division at Alternate Intervals 
Тһе 10-86с010 ee 


Comparisons of the chosen 15-second fixed di- 
vision, and of a larger and shorter fixed division 
were made with a natural division of the interac- 
tion. It was essential that this be the first step 
in consideration of the design of the instrument 
because all observations in the classroom were to 
be made by this time sampling. 


Method 


Four ten-minute films (4) were selected ав ap- 
proximating а class period. Typescripts of the 
sound tracks of these films were prepared. Arti- 
ficial division of the typescr ipts into arbitrary 
time intervals of 74-second, 15-second, and 30- 
seconds from zero time was made. This was Т” 
lowed by а division by classifiable behaviors wit | 
out regard to time. This latter division was 
termed the ‘‘natural’’ division, and а natural be- 


havior was defined as “а continuous section of the 
defined interaction, chosen without regard to time, 
forming a behavior which, when considered in its 
context, is readily classibiable in A, B, orC, 
considered separately, ог аз М.” 

Then the behavior in each 7=-весопа interval 
was classified by observers 1 and 2, individually, 
in each of the frames A, B, and C, or alternative- 
ly as N. Classification in each of the frames or 
as N was repeated for the 15-second and the 30- 
second intervals. It proved necessary to include 
an additional category ‘‘unclassifiable’’ because of 
the occurrence of a number of intervals, particu- 
larly in the 72 -ѕесопа division with too few words 
to permit classification. 

Next, the typescripts were divided by observ- 
ers 1 and 2, individually, by natural behaviors 
classifiable in frame А or alternatively as N. This 
was repeated for frame В and for frame C, lest 
the length of the natural interval in each frame did 
not coincide. 

Points of disagreement in a classification for 
each divisionwere resolved by discussion, pro- 
viding a single joint classification by observers 1 
and 2. 


Results 


Totals of teacher and pupils behaviors were ob- 
tained for the four films takenasanentity. -These 
totals for the 74-second, 15-second, the 30-sec - 
ond and the natural divisions were 324, 163, 82, 
and 175, respectively. 

Ratios of total teacher- pupils behaviors for 
these same intervals were 2.59, 2.60, 2.77, and 
2.02, respectively. In each fixed division, an en- 
larged teacher effect resulted. 

Further consideration of the та -second or the 
30-second divisions was not warranted after not- 
ing wide disparity from the natural values in their 
total number of responses, the teacher-pupil ratio, 
and the lengths of the intervals. 

A frequency distribution of the lengths of the 
natural division is illustrated in Figure 1. These 
lengths coincided in Ше classification of behaviors 
in each of the three frames. 

Using Ше ‘‘t’’ test of significance of a single 
mean (3), probability levels of non-significant dif- 
ferences between totals of the 15-second division 
and of the natural division were obtained (5) and 
recorded in Table II. It was noted that in no frame 
was a Significant difference apparent (significance 
level, p = 0.05). 

Probability levels of the non-significant differ- 
ences between totals of 15-second even and odd 
sets of intervals were recorded in Table IH. It 
was noted that none of the frames showed signifi- 
cant differences at the significance level p = 0.05. 
Indeed, of the twelve comparisons made, only two 
probability levels less than 0. 60 occurred. 

The use of this *'t" test necessitated the as- 
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TABLE II 


PROBABILITY LEVELS BETWEEN 15-SECOND AND 
NATURAL DIVISIONS* 


Measure Teacher Pupil Class 
ee ee 
РА 0.82 0.10 0.39 
Pp 0.35 0. 20 0.45 
Ре 0.32 0.06 0.18 
^ PIN.) 0.33 0.70 0.75 

с 

а 0.05 1.00 0.15 


T Teacher, pupil and class behaviors in Ability to Think, A, Appre- 
ciation, B, and Attitude, C. Symbols for Neutral and Unclassifi- 
able М, U. For А and B, n = 7; for C, n= 3; for М,Џ, n- 1. Use 
of “t” test of significance of a single mean. Significance level 
р = 0.05. Where p> 0.05, difference is not significant. Where 
p «0.05, difference is significant. 


TABLE III 


PROBABILITY LEVELS OF SIGNIFIC ANT DIFFERENCES BETWEEN 
THE ODD AND EVEN SETS OF INTERVALS OF THE 
15-SECOND DIVISION* 


Measure Teacher Pupil Class 
0. 60 0.85 0. 60 
РА 
р 0.09 0.85 0.13 
В 
0.75 0.86 0.75 
Рс : 
1.00 1.00 1.00 


Р 


(N, 0) 
in Ability to Think, А, Apprecia- 


Y viors 
* Teacher, pupil and "а pee for Neutral and Unclassifiable N, 
bo E 3; for М, U, n- 1. Use of “t”? test 


т п 
U. For Aand B, п= 7; for C, Significance level p = 0.05. Where 


i an. 
of significance ofa single f ignificant- Where p <0.05, difference is 


p > 0.05, difference is not 
significant. 
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sumption of the continuity of the variable formed 
by the differences between the frequencies of cor- 
responding categories in the fixed and in the nat- 
ural divisions. Later this assumption was avoid- 
ed by the study of each category separately. 


Discussion and Inferences 
DISCUSSION эне Se 


The 7+ -second division and the 30-second di- 
vision were inappropriate as approximations to 
the natural division because of wide discrepancies 
apparent in total numbers of responses, teacher- 
pupil ratios of behaviors and lengths of intervals. 

The 15-second division and the natural division 
compared closely in terms of total numbers of re- 
sponses and lengths of intervals, while the ratio 
of teacher-pupil behaviors still indicated an en- 
larged teacher effect. However, no significant 
differences were shown to occur between corres- 
ponding teacher, pupils and class behaviors in 
frames A, B, or C, or Neutral and Unclassifiable. 
Further, no significant differences occurred be- 
tween classifications of the even and odd alternate 
sets of 15-second intervals. 

Recognizing the limitations of the use of any 
fixed division in approximating the observation of 
natural behaviors and in particular the enlarged 
teacher effect present in its teacher-pupils ratio, 
the 15-second division at alternate intervals was 
shown empirically to be a valid sample of the to- 
tality of behaviors of a unit. 

Such comparison of the classifications of a 
fixed divisionanda naturaldivision seems unusual 
in studies of validity of time sampling. For im- 
mediate description the criterion has been that of 
reliability. For conclusions on the general situ- 
ation the accepted criterion has been that of the 
internal consistency of the data determined by 
comparison of different samples taken in the same 
manner in the same situation. Itis noted that this 
assumes that variation other than that resulting 
from randomization does not occur between suc- 
cessive samples. In the classroom, while a gen- 
eral pattern typical of a teacher and his pupils 
may well emerge, some variation between succes- 
sive lessons will be intentional or may result from 
Such other factors as change in content. 


Number of Units Necessary to Represent 
equately the Mean Behavior 
of a Topic 


Observation of classroom interaction using the 
15-second samplinginterval was then begun. It 
was necessary to determine empirically the min- 
imum number of units of behavior which would 
provide representative estimates of mean behav- 
iors of a topic. 


Method 


Three classrooms studying the same topic in 
“Algebra 3” were visited by Observer 1. Classi- 
fication of all units of behavior in that topic Was 
carried out. The results were recorded as Class- 
rooms I, II, and III. 

Total of behaviors of each teacher and group of 
pupils were calculated for each category. For 
convenient comparison, these raw score frequen- 
cies were then adjusted proportionately in order 
that a standard total number of 100 responses was 
obtained for each class period. В 

Cumulative arithmetic averages, xj, of adjust- 
ed frequencies of behaviors werecalculated begin- 
ning with unit 2. Unit 1 was not considered in 
these because it was felt likely that the first les- 
son of a topic might vary markedly from the les- 
sons of the succeeding days. The means for the 
complete topic, including unit 1, were defined as 
the theoretically best estimates for each category 
for the topic: Xi. 

Frequencies of individual categories were test- 
ed for the significance of their differences from 
the theoretical frequencies by considering the 
probability of occurrence of such differences 1n 
the binomial distribution. 

Present in the use of this analysis was the as- 
sumption of independence of xj and xj. This as- 
sumption would prove justifiable in practice if the 
theoretical value were the mean of a very large 
number of units of behavior. The errors entailed 
by the assumption in this problem became quite 
marked as the number of units of behavior includ- 
ed in the cumulative averages increased towards 
the total number of units observed. It was felt, 
however, that consideration of individual categor- 
ies by the binomial distribution included fewer as- 
sumptions than would Ше “Ч?” tests as used above, 
or the X test, which was possible in numerous in- 
vestigations throughout the literature. In addition 
to the assumption present in this use of the bino- 
тігі distribution, the use of X? inthe present stage 
of analysis wouldinclude an assumption of inde- 
pendence of the several categories. While their 
exclusiveness has been argued logically and im- 
posed by the method of classification, it was felt 
that empirical justificationof an assumption of in- 
dependence should precede use of х2, 


Results 


Probability levels of non-significant differences 
between successive cumulative averages and the 
theoretical averages were obtained for each cate- 
gory. A general gradual increase in the levels of 
probability occurredas the number of units includ- 
ed in the cumulative averages increased. In сег” 
tain instances, this rise was discontinuous. In nO 
category nor inany classroom were consistent 
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sharp increases in the levels of probabilities ob- 
served. 

Investigation followed of the differences be- 
tween the four-day cumulative average (largest 
obtainable from all three classes) and the theoret- 
ical average in all three classes. For this aver- 
age of four units the numbers of categories not 
significantly different from the theoretical values 
was noted for probability levels: p= 0.70, р = 0.80, 
and р = 0.90. These were approximately 4/5, 3/5 
and 2/5, respectively. 

Limited assessment followed of the consistency 
of the results of asingleday with the mean scores. 
At a probability level of p = 0.05, less than 1/10 
of the scores of individual categories for the 
single unit differed significantly from the mean 
category score. 


Discussion and Inferences 


In seeking a practical limit to the number of 
days necessary for observationofa classroom, it 
has been decided that the totality of behaviors in 
a topic would be assumed to represent all behav- 
iors occurring in the classroom. Cumulative av- 
erages of the units of a topic were then consid- 
ered, in an attempt to determine a small number 


of units that could be used to approximate the topic. 


It was considered that if such a small number of 
units existed within a topic, that a sharp rise in 
the probability of non-significant differences 
should occur throughout all the categories and in 
each experiment onaparticularday. Such a sharp 
rise did not occur. The steady though occasion- 
ally discontinuous rise which was met could well 
be accounted for by the increasing degree of de- 
pendency between cumulative averages and theo- 
retical values as numbers of units increased. It 
was felt, therefore, thatthe small number of units 
providing the theoretical values of the categories 
prevented an answer to this problem. A profitable 
study might well result from consideration of the 
same problem using observations collected over 
a very large number of units, eliminating the as- 
sumption concerning the topic behaviors equalling 
the totality of all classroom behaviors, and min- 
imizing the dependence of cumulative averages on 
the mean of all units. . 

It was necessary, practically, for the remain- 
ing observations inthis series to limit the number 
of units of observation in each classroom. It was 
decided to visit classrooms during four units of 
behavior, and presuming that the first and last 
days of a topic might well be aty pical, to collect 
data only on intervening days of topic. 


The Level of Agreement of Observers in Ap- 


Не inne nr Class. 
plying the Arbitrary Definitions of Class- 


ification of Behaviors 
E E ES 


As a prerequisite to validity, it was also nec- 
essary to assess thearbitrary definitions of 


classification of behaviors in the light of observ- 
er agreement. 

The test of observer reliability was twofold: 
first, the level of agreement in classification 
achieved between observers in single units of class- 
room behaviors, and second, the level of agree- 
ment in classification reached between repeated 
observations by the same observer of a single unit 
of behavior. 


Method 


Two observers took part in the complete set of 
observations. Fourteen class periods were clas- 
sified by both Observers 1 and 2. In addition each 
observer made repeated classification of a single 
recorded unit of behavior at three weekly intervals 
throughout the experimental period. 


Results 


Level of Agreement in Classification Between 
Observers—Scattergrams of the corresponding 
scores of Observers 1 and 2 in all categories of 
teacher, pupil and class behaviors for units of ob- 
servation 1, 7 and 14 are shown in Figure 2. Ву 
day 14, the results indicated a marked rise in ob- 
server agreement over those for units 1 and 7, 
and a high absolute achievement of ag r e ement in 
the close clustering of points about the line of per- 
fect agreement. 

The probability levels of individual category 
frequencies of behaviors classified by Observer 2 
being different from those as classified by Observ- 
er 1 were recorded. Asummary of the number of 
differences non-significant at the probability level 
р = 0.70, was illustrated graphically on alinear 
Scale in Figure 3. 

After the first four observations a marked trend 
towards increasing agreement was noted. 

Level of Agreement in Classification Between 
Repeated Observations of a Single Unit by the Same 
Observer— Observed frequencies and deviations of 
these from ''true" scores in each category were 
recorded for Observer land for Observer 2. П- 
lustration was made of deviation from true values 
in Figures 4 and 5 for Observers 1 and 2 respec- 
tively. The horizontal axis represented zero de- 
viation from the theoretical frequency. 

These figures were used to compare observed 
with theoretical frequencies; successive observa- 
tions by the same observer with eachother; teach- 
er, pupils and class patterns; and finally, the 
classifications of the two observers. 


Discussion and Inferences 


Realization of the lack of a well-de v eloped 
methodology of observational techniques cited by 
Heyns and Lippitt (7) continued togrow in seeking 
to assess observer agreement. 

Use of means of observer scores in statistical 
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sment of between observer agreement was 
EE HEGE by such investigators as Bales (2) 
and Steinzor (8) without indication of the depend- 
ence of this mean on the individual observer 
scores compared with it. In addition, the level of 
experience of the two observers was not compar- 
able, preventing use of the technique suggested by 
Guetzkow (6). Differences between the scores of 
Observers 1 and 2 were consequently studied by 
considering the probability of Observer 2’s scores 
being different from theoretical (Observer 178) 
scores, by means of the binomial distribution. 

In the area of level of agr eement between re- 
peated observations by the same obs erverofa 
Single unit of behavior, the probability of differ- 
ences occurring between observed and theoretical 
results also was studied by means of the binomial 
distribution. However, the nature of the Scores 
themselves induced a further difficulty. Where 
frequencies in individual categories were low, and 
in particular where the theoretical frequency of a 
category equalled zero, the resultant 0.00 prob- 
ability presented a markedly distorted picture. 
Therefore, it was dec ided to indicate deviation 
from the theoretical results atthis stage by abso- 


lute values, and to illustrate this ona linear scale 
in Figures 4 and 5, 


Level of Agreement in Classification Between 
Observers— Agreement between the two observers 
increased rapidly during the 14 units of observa- 
tion until 50/60 categories were not significantly 
different at a probability level p = 0.70, Unfor- 
tunately, no evidence of use of the binomial ex- 
pansion in a similar type of test was found in the 
literature to enable direct comparison with other 
Studies. However, thelevelof agreement reached 
Seemed to be entering a favorable level. 

Because only two observers were compared, 
no conclusion may be drawn beyond the immedi- 
ate situation. However, the results would infer 


) was fairly 
d from the 


eviations in 
successive trialsthat some increased Skill with 


additional experience was gained, Similar con- 
siderations follow from the two trials of Obsery- 
er 2. M 

Two categories showed marked deviation from 
the theoretical values, despite consistent scores 
in successive trials and between observers; the 


positive andnegative aspects of cu riosity. This 
effect had not been unexpected. Discussion of in- 
terpretation of behaviors in this frame of attitude 
had brought out the im portance of inflection of 
voice, and the effect of overall lesson develop- 
ment. It had been considered that these two fac- 
tors might wellbe hiddenor minimized in theclose 
examination of a typescript as used in this part of 
the study. This emphasized the importance of di- 
rect classroom observation. 


Magnitudes and Variability of Mean Behaviors 
of Teachers an Pupils 
ets апа Pupils 


These considerations ofthe validity of the sam- 
pling interval, the minimum number of units to be 
observed, and thelevel of agreement of the observ- 
ers necessarily preceded interpretation of results. 

Three main questions regarding classroom be- 
havior observed were raised. First, were the 
mean behaviors in Algebra 1 classes significantly 
different from those of Algebra 3? Second, how 
Should the magnitudes and var iability of the se- 
lected aims be combined in describing the class- 
room? Finally, were known characteristics of 
teacher or pupils logically related to significant 


differences of behavior геу ealed by the instru- 
ment? 


Method 


Materials Used—Interaction in six classrooms 
of each of Algebra 1 and Algebra 3 was classified 
by Observer 1 for four units of behavior. 

The “Ч” test of Significance of the difference 
between means was applied to corresponding be- 
haviors in Algebra 1 and 3 (р = 0.05) (ref. 3). 

Scores of individual categories of teacher and 
pupils were illustrated in two wyas: in absolute 
value on a bar g 
uses, by a profile comparison. In pr eparing the 
profiles, it was noted that scores of negative cat- 
egories were inherently negative. Thus the mean 


: utral category was considered 
negative, However, since it Seemed likely that 
some neutral behaviors were necessary, caution 


was marked in inte Tp reting values falling far tO 
the right as indicatin, а i 


S that might logically be expected from 
istics. This is example 
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evidenced in class by an increased number of teach- 
er behaviors. Inparticular, actual planning of the 
problems to be used in the lesson period itself 
would include a greater need for teacher demon- 
stration of analyzing and/or synthesizing rather 
encouragement of this inthe pupils. It was also 
presumed that encouragement of initiative would 
be low to avoid revelation of inadequate prepara- 
tion. the scores in pupil initiative might be ex- 
pected to indicate whether this lack of teacher 
preparation was a common experience: relatively 
high Со. would be likely to endorse this point of 
view. In addition, it was expected that pupil in- 
teraction would tend to contribute to the solution 
rather than the understanding of the problem—thus 
an emphasis in pupil behavior of analyzing and/or 
Synthesizing and a decrease in Specializing. 
Therefore, comparison of the results of Class XI 
with means of Algebra 1 was made in the follow- 
ing areas: 


1. Totals of teacher and pupil behaviors 

2. Teacher behaviors іп Аз +, A34, Ca., and 
C2- 

3. Pupil behaviors in A24, Аз+, Са», and C3- 


Results 


Means and standard deviations of raw scores 
forming representative behaviors were obtained 
for eachclassroom. From these means and stand- 
ard deviations for the Algebra 1 and the Algebra 
3 classrooms were calculated. Using Ше ‘‘t’’ test 
of significance of difference between means, and 
a probability level of 0. 05, only two of the twenty- 
One categories of teacher behaviors were found to 
be different. These were positive e — 
positive synthesizing. In each case the mean fre- 
quency of the teachers in Algebra 1 was зева үп 
the mean frequency of the teachers in Ае — 
One category of pupil behavior was — M 
was negative curiosity, | хав. mean Alge 
was greater than mean Algebra ә. : 

A bar graph illustrating the mean Lus a 
teachers and pupils in Algebra 1 is shown кл 
ure 6. Most of the behaviors in each жеге г ved 
ed to the three frames of reference, only тата 
in one hundred proving neutral in these ха 
Consideration of the number of behaviors cope ai 
ed for teacher and pupils separately emphas 


i d in class inter- 
the major part the teacher playe proximately 


action. Despite one teacher е ег behav- 

25 pupils making up a class, five Eos am 

iors occurred to every two pupil comparison 
Using the profile of Class XI for 


i i were 
with mean values, the following АША pupil behav- 
tere ОЛ jem теч = ir difference 
iors of Class XI were tested for Lote using >. 
from mean behaviors of six у org. This was 


Р + 2 
Corrected for continuity, * 


less than that of ^? atthechosen Significance lev- 
el, viz., 3.84, so goodness of fit was established. 
Thus the hypothesis that the number of teacher 
behaviors would be sign ificantly less than mean 
teacher behaviors was not Supported. (2) From 
the profile comparison, Figure 7, it was noted 
that teacher f reque ncy in positive Synthesizing, 
Аз, Was high, teacher Írequency in positive Spec- 
ializing, Аз, was low, teacher frequency in pos- 
itive independence, C;,, was low, and in negative 
independence, C;., was average. Thefirst three 
relationships with the mean were thus shown to 
agree with the earlier hypothesis. (3) From the 
profile comparison, Figure 7, it was noted that 
pupil frequency in positive synthesizing, A;,, was 
high, pupil frequency in positive Specializing, Аз. 
was low, pupil frequency in positive independ- 
ence, С, was low, and in negative independence, 
C;., was low. The firstthree relationships agree 
with the hypothesis. The point of view that the 
type of lessons observed occurred regularly was 
not supported. 


Discussion and Inferences 


Generally, mean behaviors of teacher, pupils 
or class in Algebra 1 were not different to those 
shown in Algebra 3. It may be inferred that the 
two samples studied were drawn from the same 
population. Thus in this instance, differences in 
Specific subject matter or in age of pupils did not 
affect significantly the pattern of behaviors. This 
was not surprising in terms of subject matter for 
the courses represented two parts of the overall 
algebra program for the state with differences in 
content rather than in difficulty of subject matter. 
It was possible, however, that the differing ages 
of the pupils in the two samples, and the probable 
selection of pupils preceding the course in Algebra 
3 might have resulted in general significant dif- 
ferences in pattern. Later study, then, might be 
carried out within the secondary school algebra 
program with less regard for the formal divisions 
occurring in the schools. 

The order of emphasis of categories in the aim 
ability to think wasas follows: specializing, syn- 
thesizing and generalizing (approximately equal), 
analyzing. It was felt that the order of the first 
three was reasonable, but that ће small incidence 
of behaviors in analyzing was undesirable in the 
light of the widespread usefulness of this mode of 
thought. The order of emphasis in the aim of ap- 
preciation was as follows: methodology and Sub- 
ject matter (approximately equal), other fields 
and areas, historical development (no behaviors). 
It was feltthat emphasis of the first two was most 
desirable in mastery of a body of knowledge, but 
that greater emphasis inthelatter two would con- 
tribute to the pupils’ enthusiasm and their under- 
standing. 

The order of emphasis inattitude was negative 
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itive independence (approximately equal), 
ane ды (п rem igible). The low incidence of 
curiosity was disturbing in considering the poten- 
tial force for leaning unexploited. Greater em- 
phasis of positive independence was also felt de- 
sirable, although as the negative aspect included 
an absence of encouragement as well as active 
discouragement uncertainty as to the level to be 
attained was expressed. 

Examination of the profile comparison in light 
of known characteristics and expected differences 
supported the validity of the instrument, for gen- 
erally, expected differences did occur. 

Development of these two aspects of examina- 
tion of scores—in absolute value and in profile 
comparison— would seem promising in seeking a 


means of evaluating the performance with this 
instrument. 


Summary 


The design of an instrumentfor direct Study of 
the secondary school mathematics classroom is 
described. This design required the selection of 
criteria for judgment in secondary school mathe- 
matics; the development of an obs ervational 
schedule based on these criteria; and the empiri- 
cal assessment of the instrument in the class- 
room. 

As criteria for judgment, careful definition 
was made of certain aims of mathematics teach- 
ing. These aims were defined in three frames of 
reference each of several categories: 


A—Ability to think, analyzing, 
Specializing, generalizing; 
B—Appreciation of mathematics, methodology, 


Subject matter, other fields and areas, his- 


torical significance; 
Сы 


Attitude of curiosity and initiative, enthusi- 
asm for fresh knowledge, and independence. 


Synthesizing, 


Alternate inter- 


) BEEN g being carried out in the 
intervening intervals. The period of observation 
was forty-five minutes in length, resulting in the 
classification of ninety behaviors. Each class- 
room was visitedfor four such Periods, the mean 
behaviors resulting in “ d 


representative” av- 

iors for that classroom, ` bea 
The observers were the investigator. ; 

; SUB а 

fourth-year student, a poten Pooky 


tial teacher of math- 
ematics. It was essential that the observer be 
skilled in subject matter as wel] as in the tech- 


niques of the observation schedule, 
Empirical assessment in algebra classrooms 


followed this period of design of the instrument. 
Analysis of the results permitted the following 
inferences: | 

Ву an unusual direct comparison of behaviors 
yielded by time sampling and by a defined natural 
division of the verbal interaction, the choice of 
the 15-second interval was supported. In addition, 
a limited consideration of the consistency from 
day to day of observations by this sampling ішег- 
val showed a reasonable level of agreement. Fi- 
nally, the facility of its use in the classroom wae 
so marked, that the validity of this sampling in- 
terval was felt to be established. 

The second sampling procedure used was that 
of observing eachclassroom for a limited number 
of days. No conclusive answer to this problem 
was obtained. Practically, it was decided that for 
the rest of the present study the sam ple should 
consist of four days of observation of each class- 
room, 

А high level of consistency in interpretation of 
behaviors was shown by the principal observer, 
and to a lesser degree by the assistant observer 
who was still gaining observational experience. 
These findings supported the classification of be- 
haviors determined. 

The use of bar graphs of absolute values of 
teacher and pupil behaviors, and profile compar- 
isons of individual with mean values permitted де“ 
tailed description of actual emphasis of aims iD 
the classroom and comparison of individual class” 
rooms with mean values, Consideration of cer- 
tain known char acteristics of teachers or pupils 
Which were in agreement with differences shown 
by the instrument supported the validity of the de- 
Scriptions yielded. Such profile comparison may 


well prove the basis for development of the in” 
strument for evaluative purposes. 


General Appraisal of the Instrument 
права. of the Instrument. 


Because the basis of judgment for this instru- 


ment is that of aims of mathematics teaching, the 
instrument is theoretically limited to use in math- 
ematics classrooms. However, the surprisingly 
general nature of the statement of aims might well 
bear consideration for application by specialists 
in other subject matter fields to their own situa- 
tion. If such application was successful, general 
systematic comparison of teacher or pupil реһау- 
iors, independent of subject matter yet including 
in the observer interpretation the quality of the 
subject matter field, could be attained. 

It is noted that the observers must be exper- 
ienced in the subject matter field as well as traine 
in the particular observational procedures. 

The behaviors classified with this instrument 
are those of verbal interaction or its attendant 
demonstration. Thus the description of the 1550 
Period yielded by the instrument is limited to the 
amount of verbal class interaction occurring. 1 
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classrooms where long periods of supervised 
study are regularly used, this limitation may be 
so great that a picture of the in-lesson period by 
this instrument would be impossible. 

The recording of class interaction in the pres- 
ent study has been dividedinto teacher scores and 
pupil scores. The pupils’ scores are recordedas 
a group, thus no indication is available of individ- 
ual pupil performance. In addition, discussion 
between teacher and individual pupils outside of 
the class interaction period—for ins tance while 
some of the class is studying—has not been con- 
sidered. Thus a possible area of great teacher 
and pupil variation is untapped by the present ob- 
servational procedure. 

Currently another limitation exists in the need 
for preparation of the observer by the investiga- 
tor. This will be necessary until further study of 
the training procedures can be carried out. 
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THE SEMANTIC METHOD FOR TESTING VOCABULARY 


MILDRED BERWICK 
Boston College 


Chestnut Hill, Massachusetts 


Purpose of this Study 


THE BASIC objective of this study was the de- 
velopment of vocabulary tests that ade quately 
measure knowledge of mul ti-meaning words by 
fourth-grade pupils through eighth grade. It was 


believed that this objective could be reached 

most effectively by: 

1. Constructing vocabulary tests involving se- 
lections of meaning in relation to a pupil’s 
interpretation of a word symbol. 

2, Measuring more meanings per word than 
previous vocabulary tests. To illustrate 
the single meaning test, anitem taken from 
the Metropolitan Tests, Elementary Read 
ing Test, Form В, is shown below: 


9. sack seat hold bag lift box 
The Multi-Meaning Vocabulary Tests, inthe item 


for the word “баск,” include several meanings 
of the word. To illustrate: 


(x) a large bag 
() to hold 
(x) a loose coat 
sadi: (x) to discharge 
() some ashes 
(x) to plunder 
(x) a white wine 
( ) to hunt 


3. Measuring more meanings in approximately 
the same time limits. Individuals were able 
to designate all the meanings of a word in 
about the same time limits as they would 
need for finding the Single meaning response. 

4. Determining grade levels of meaning for 
each multi-meaning word used in the test 
for grades four through grade eight. No 
claim is made that this is a classification 
of multi-meaning words according to grade 
levels, but pupils' performance on each 
item demonstrated their proficiency or fail- 


ure in identifying the meaning at each grade 
level included in the test. 


Development of the Test 


Method of Selecting Words for the Test 
res tor the Test 


The Teacher’s Word Book of 30, 000 words was 
used as a source for securing a basic vocabulary 
list. The words in this book are not classified ac- 
ich they are intended, 
tical order and coded 


nd ending with the 
d according to grade 
rds obtained for each 
and also the total num- 


levels. The number of wo 
grade is shown in Figure 1 
ber of words. 

It will be noted that some lists were combined, 
having one list for two grades while other grades 
had their own lists. This grouping was used inthe 
Teacher’s Word Book of 30, 000 words. 

Since it was desired that the final selection of 
words for the test was the best sampling of multi- 
meaning words possible, several types of discard- 
ing were done and criteria to be met were estab- 


lished from the large initial list of 8008 words, 
The first discarding was: 


с first discard is shown in 
Figure 2. The results of the first discard reduced 
the initial list to 6504. 


The next discarding was: 


2. To discard all Single meaning words. 
of the 6504 words was 


abridged Edition o International 
Dictionary, and if only one meaning was listed for 
a word the list, 

5 that the list was 
ага. 
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FIGURE 1 
NUMBEF OF WORDS LISTED FOR EACH GRADE FROM THE 


THORNDIKE BASIC VOCABULARY LIST FOR GRADES 1-8 


Total Number of Words 


8,008 
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FIGURE 2 
NUMBER OF WORDS LISTED FOR EACH ЗНАТЕ AFTER CERTAIN PARTS 
ОР SPEECH МЕКЕ ELIMINATED 
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FIGURE 3 
NUMBER OF WORDS LISTED FOR EACH GRADE AFTER 


THE SINGIE MEANING WORDS WERE ELIMINATED 


Total Number of Words 
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FIGURE 4 
NUMBER OF WORDS LISTED FOR EACH GRADE AFTER 
FIVE OR IESS MEANINGS WERE ELIMINATED 
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main on the list. This was reasonable because 
the majority of words have connotations and nu- 
ances of meaning that are not concepts and could 
not be accepted. 


The third discarding was: 


3. All words having five meanings or 1658, 
All of the word lists for each grade were reduced 
considerably. Little difference exists between 
the total number of words per grade. 

Each word now on the list was looked upin the 
Thorndike-Lorge Semantic Count of English Words. 
This study appears only in mimeographed form 
and gives the relative frequency of occurrence of 
each meaning per mille. 

А premise was set up that а meaning to be used 
in the test had to be atthe .050level. This means 
that of 1000 occurrences of the word, a meaning 
was used 50 times, Many meanings were elimin- 
ated, but no word was dis carded through this 
Screening because a mul ti-meaning word would 
have at least one common meaning above the .050 


level to be classified in Thorndike's Basic Vocab- 
ulary List. TET 


The fourth discarding was: 


4. Words like do, make, like, come, et cetera, 
having one over-worked meaning, and the others 
used infrequently, were discarded. Nearly 50 per- 
cent of the words were discarded. 

There were still too many words to use in a 
preliminary test, because each key word selected 
would test on the average of five meanings, which 
would mean, if no further discarding was done, 
that over 7000 meanings would be tested. It would 
be almost impossible to secure time enough for 


children in a public school to do such a lengthy 
project. 


The last discarding was: 


5. To tabulate the fre 
each meaning of the 157 
А frequency average pe 
for each multi-meaning word. 

The final list consist 
words and 1984 meanin 
1 and 2 had the highest 
words after they were ave raged for highest Íre- 
quency of occurrence of meaning. Many simple 


words have high levels of meaning; therefore, ће 
final selection was justifiable. 


Design of Test 


Since these tests were developed on the princi- 
ple of measuring multi-meaning Words, the first 
essential was to find out if pupils recognized 
words that had more than one meaning. 


Sub-Test I was labeled a Multi-Meaning Word 
Recognition Test. A sample item is shown below: 
lettuce moonlight paddle squash port 

The pupil underlined the words which have 
more than one meaning. The preliminary test con- 
sisted of four units, 50 items to a unit with five 
words in each item and a total of 803 multi-mean- 
ings were presented to each pupil for recognition. 
An item analysis was done on this test, and the 
final tests consisted of three forms, 40 items to 
aform. A total of 484 multi-meaning words was 
tested. 

These vocabulary tests, exceptfor the recogni- 
tion test, purport to measure more meanings per 
word than other vocabulary tests; therefore: 

Sub- Test II was a Multi-Meaning Identification 
Test. A multi-meaning word was listed along 
with its various common meanings according to 
highest frequency of occurrence of meaning. In 
the preliminary test each multi-meaning word was 


listed with ten alternatives, such as items like 
these: 


machine for weighing (x) 
to climb up (x) 
to make a drawing (x) 
according to a gradu- 

Scale ated table () 
series of tones (x) 
a yardstick () 
а system of numbering (x) 
a barometer () 


A total of 1984 meanings was measured in this pre- 
liminary Sub-test, 


After the refini 


items and seven choices for each key word. The 
total number of meanings tested in the three forms 
was 690. 


vocabulary testing 
plays in a true con- 


paragraphs were writ- 


y test, and 294 meanings 
were tested, for instance: 


The stagecoach stopped before the old( ) lodge. 
The passengers were glad to see the lights burn- 
i ( ) lodge for the night, One 
€ would ( )lodgea complaint 
delayed on his journey. 


1. to find fault 2, airport 3. rather angry 


4. to take rest 5, tobe weary 6. an inn 
7. not to be disturbed 


Sixty-one paragraphs жеге obtained for the 
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FIGURE 5 
NUMBER OF WORDS LISTED FOR EACH ОВАТЕ AFTER WORDS HAVING ONLY 


ONE MEANING OF HIGH FREQUENCY OF OCCURRENCE WERE ELIMINATED 


Total Number of Words 
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FIGURE 6 
NUMBER OF WORDS LISTED FOR EACH GRADE 
AFTER THE FINAL DISCARDING WAS DONE 
Total Number of Words 
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three forms of the final test after the item anal- 
ysis was done, and 183 meanings were measured. 


Trial Administration 


Following completion of preliminary editing, 
trial administrations of the items were conducted. 
The purpose of these try-outs was two-fold. The 
first objective was to determine the difficulty of 
eachitem for the grades for which the tests were 
intended. This information would be useful in ar- 
riving at a final list of words constituting a test 
with a difficulty level appropriate for each indi- 
vidual taking these tests. The second purpose 
was to identify the ineffective items, as such 
items would have no value and would not be includ- 
ed in the final test. 

In order to determine the difficul ty of each 
item and to identify the ineffective items, trial 
administration of the tests was given to a sample 
population of 491 pupils in grades 4, 5, 6, 7 and 
8 in Dover and Nashua, New Hampshire. 

The pages in each test were put in different 
chronological order so that the first items would 


not always be first, and the last items would not 
always be last each time. 


The tests were given in tw 
sittings of fifteen minutes еа 
utive days, one test in the 
one in the afternoon session. 

Keys were prepared for each test. All of the 
tests were hand-scored by persons who had had 
much experience in correctings. 


enty-two different 
ch for eleven consec- 
morning session and 


Item Analysis 


Data showing the distribution of the items ac- 
cording to difficulty level were tabulated. 

A frequency count was made on each of the 
items in the Recognition, Identification, and Con- 
test Tests. This was done on the results of the 
preliminary tests for grades 5, 6 and 7, because 
it seems that the average would be more consist- 


ent in the middle grades for which the test was de- 
Signed. Items w 


ponse. 
The item analysis included: 


1. Samples of Recognition Item 
and Rejected 


Table I represents sample words illustrative 


S Accepted 


of the type of words accepted for the Recognition 
Test. 

А number of words lacked discriminating 
power, and the average levels of difficulty were 
either too easy or too hard and were rejected. 
Samples of these words are listed in Table II. 

It was found that from the initial 803 multi-mean- 
ing words used in the try-out recognition test, 484 
fulfilled the requirements on the difficultylevel, as 
the average centered on the 50 percent difficulty and 
easelevel. The discriminating level of. 01 also es- 
tablished that there was only one chance ina hundred 
the pupil could guess the correct response on these 
words. They alsoshowed agradual increase from 
grade to grade, indicating the higher the grade, the 
more success the pupils had on the words. 


The next item analysis included: 


2. Samples of Identification Test Items 

Accepted and Rejected 

To illustrate the item analysis process on 321 
key words and 1984 meanings, sample items ac- 
cepted for the Identification Test are shown in 
Table III. 

Items like the above with gradual levels of dif- 
ficulty for most of the meanings and good discrim- 
inating powers were r etained in the final forms. 
Frequently a meaning fulfilled the requirements 
except that success at different grade levels var- 
ied in range, or there were small minus differ- 
ences at higher grade levels. Itc ould hardly be 
€xpected that responses on so many meanings 
would always show Comparable increases; there- 
fore, meanings were accepted if they showed good 
discriminating power, and the majority of mean- 
ings for the key word had gradual increases of suc- 
Cess, and the average difficulty and ease level 
centered on 50 percent, 

. Most of the words eliminated in the identifica- 
tion test showed poor discriminating power. Sam- 


зи ps Showing this factor are presented in Ta- 
е Гү, 


3. Samples of Paragraph Test Items 
Accepted or Rejected 


An item analysis determined the paragraphs 
that were desirable from a standpoint of level of 
ease and difficulty and good discriminating power. 

A sample paragraph is presented to show the 


design of this test, and the number of meanings 
presented for identification: 


When the vill 
of the fisherm 
the men. 


age bell rang out the wives 
€n rushed to the shore to greet 
Their faces would () beam as 
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TABLE I 


SAMPLE OF RECOGNITION TEST WORDS ACCEP 
TED ON THE BASIS OF 
DISCRIMINATING POWER AND GRADUAL EASE OF DIFFICULTY 


Grades 
5 6 7 à 5 

уегаре Регсепї Discrimin- 
Word % % % of Difficulty ating Level 
Pitch 37 52 65 51 101 
Lodge 45 52 58 52 . 01 
Plain 36 41 49 42 101 
Саві 30 39 59 43 . 01 
Раск 33 48 60 47 . 01 
Вопа 40 47 56 48 „01 
Trail 42 54 68 55 .01 
Арреа1 27 40 65 44 „01 
Prime 29 45 53 42 . 01 
Bulb 31 51 56 47 .01 


м 


ТАВЬЕ П 


SAMPLES OF RECOGNITION TEST WORDS REJECTED ON THE BASIS OF 
DISCRIMINATING LEVEL AND DIFFICULTY AND EASE LEVEL 


Grades 

5 6 Т Average Percent Discrimin- 
Word % % % of Difficulty ating Level 
واد‎ RENE лы ы ee 

44 89 82 12 .23 
ne 28 36 35 33 111 
Quail 17 27 32 25 511 
Wax 16 41 37 37 .23 
Нагр 33 56 10 53 39 
Е!дег 11 41 42 31 .23 
Ham 58 78 82 | у Ч 
Organ 12 26 48 Ec 
Yoke 15 37 32 28 i 11 
Ваггїег 11 21 42 25 а 
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TABLE Ш 


SAMPLES OF IDENTIFICATION TEST ITEMS ACCEPTED ON THE BASIS OF 
DISC RIMINATING POWER AND GRADUAL EASE OF DIFFICULTY 


— M ——— MÀ 


Grades 
9 6 7 қ uet 
l—«—— HÉ Д Average Percent  Discrimin- 
Key Word (Palm) % " % of Difficulty ating Level 
Meanings Tested 
Tropical tree 61 70 72 68 . 01 
Emblem of victory 24 33 44 34 ‚01 
Flat part of hand 57 68 81 69 01 
Measure of length РУ 
То pass by fraud 16 29 34 26 .01 
Ауегаре difficulty of all meanings 49 


7 
— a M ДА Average Percent 
Key Word (Crown) % % % of Difficulty 


ating Level 


Discrimin- 


А royal headdress 82 86 94 87 01 
Topmost part | 

of head 51 55 57 54 01 
British coin worth | 

five shillings 11 18 16* 12 01 
Part of tooth 25 27 24 25 39 
To invest with | 

regal power 19 24 26 23 01 
To hit on head 42 36 52 43 lle 
Crest of a bird 29 21 46 34 | 03жж 
То curve upward 13 15 22 17 ` 11** 

Average difficulty of all meanings accepted 45 
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ВЕКУЛСК 


they saw the sails in the distance. A () 
beam of light would shinein the water, and 
later they could make out the nets hanging 
from а ( ) beam on the stern. 


1. aradio 2. aheavy timber 3. show worry 
4. smileradiantly 5. candlelight 6. heavy 
cable 7. a ray of light 


The item analysis determined the paragraphs 
that were desirable from a standpoint of level of 
ease and difficulty and good discriminating power. 
Table V includes responses on the above para- 
graph, and were the type accepted for the final 
forms of the test. 

An illustration is given іп Table УІ of a para- 
graph that was rejected on the basis of poor dis- 
criminating power. 


As the flood rushed toward the old dam 
the men worked desperately to ( ) bar it 
from overflowing onto the highway. Sand- 
bags were used as a ( ) bar where the dam 
was weak. Completely out of sight was the 
( ) bar that usually reached out into Ше 
channel. 


1. a rushing sound 2, sand bank 3. pre- 
vent 4. danger signs 5. support 6. make 
bare 7. barrier 


Paragraphs like the above had to be discarded 
because the responses were influenced too strong- 
ly by the chance factor. 2 

The number of paragraphs in the preliminary 
context test was ninety-six. Sixty-one para- 
graphs fulfilled the requirements set up for items 
to be used in the final context tests. 


Test Developments 


j was analyzed, the final forms 
api парити all the words that fulfilled the 
1 ostulated for good test 1 қ 

n نا ت‎ гё алела balanced items in each € 
of the three tests, the words were pronn лав 
cording to average levels of difficulty an ас - 
The items were arranged in blocks of five, a 
an average was taken after every fifth per 
equalize each form. Thus, pupils who em 
not finish the test would always encounter 5111 wi 
items in any form of the tests at every fifth uc 

The words were placed as nearly as possi зай 
іп each form to measure the same pu 
ings. Occasionally this was impossible | > = 
if they balanced in difficulty, they di а -- 
ber of meanings at every fifth item. سے س‎ 
Occurred, they were palanced in the ne 
five items. 

Table ҮП shows in schematic form the plan 
for assigning words to each form. 


were m 
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Statistical Data on Population 


To secure data bearing upon the type of popula- 
tion reliability and validity of the constructed tests, 
the following tests were given to 491 pupils in 
grades 4 to 8 in the public schools at Dover and 
Nashua, between November 1 and December 19, 
1951. 


1. California Short-Form Test of Mental Matur- 
ity, Elementary S-Form, Grades 4-8 (1938). 

2. Gates Reading Survey for Grades 3 (2nd Half) 
to 10 (Vocabulary, Level of Comprehension, 
Speed and Accuracy). 

3. Multi-Meaning Vocabulary Tests (Reco gni- 
tion, Identification, and Context), Form A. 

4. Multi-Meaning Vocabulary Tests (Rec ogni- 
tion, Identification, and Context), Form B. 

5. Multi-Meaning Vocabulary Tests (R eco gni- 
tion, Identification, and Context), Form C. 


Analysis of Data 


The sensitivity of any test as a meas uring in- 
strument depends upon the ascending order of 
means between grades. Every test should serve 
to distinguish relative positions of groups as re- 
gards the trait which is measured. Table VIII il - 
lustrates the differences between the mean raw 
scores and standard deviations of each grade tak- 
ing the recognition test. 

The greatest apparent differences in mean 
scores were between grade 4 and grade 5, an av- 
erage difference of 21 points; then grade 5 and 
grade 6, an average difference of 10 points, and 
grade 6 and grade 7, an average difference of sev- 
en points. Grade 8's difference was three points 
lower on Form А, and practically equal on the 
other two forms to grade 7. 

Data showing the mean scores for grade 4 
through grade 8 obtained on the identification test 
are shown in Table IX. 

Whereas, the mean scores for each grade show 
an increase from one grade to the next, an average 
difficulty of 5 points between grade 4 and grade 5, 
of 21 points between grade 5 and grade 6, of 13 be- 
tween grade 6 and grade 7, the average difficulty 
between grade 7 and grade 8 drops to 4. 

Data showing the mean scores obtained on the 
context by pupils in grade 4 through grade 8 are 
represented in Table X. 

A striking fact lies in the failure of grade 8 to 
make little or no gains on any of the tests. 

The gains for the other grades were consistent 
and a steady increase was noted between mean 
scores of consecutive grade levels. Е 

The final forms of the tests were пої adm inis- 
tered, but re-scoring was done on each of the 
items accepted for the final tests from the try-out 
tests. The advantages of re-scoring after tests 


have been refined are: 
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TABLE УП 


EQUALITY OF ITEMS FOR THE THREE FORMS OF THE TEST 


Average Percent Discriminating Number of 
Word of Difficulty Level Meanings 
Form A 
1. Palm 48 . 01 4 
2. Beat 40 . 01 5 
3. Зпар 49 .01 5 
4. МШ 56 .01 4 
5. Fly 50 .01 5 
— EN 
Form B 
1. Bolt 47 „01 5 
2. Master 42 01 4 
3. Point 47 „01 5 
4. Arm 58 „01 4 
5. Net 49 .01 4 
ТТТ А A 
Form С 
1. Date 50 „01 4 
2. Guard 48 01 5 
3. Огдег 43 101 5 
4. Match 52 101 4 
5. Grant 50 101 4 


Form А - Average Percent of D: 
Level, .01; Number 
Form B - Average Percent of D 
Level, .01; Nu 
Form C - Average Perce 
Level, 


ifficulty, 49; Discriminating 

of Meanings, 23. 

ifficulty, 49; Discriminating 

mber of Meanings, 2 

nt of Difficulty, 49; Discriminating 

-01; Number of Meanings, 22. 

Note: Inspection of the table shows tha 
iculty and discriminating levels. 


t the items were balanced in dif- 
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TABLE XII 


CORRELATIONS BETWEEN SCORES ON THE CONSTRUCTED MULTI-MEANING 
SUB-TESTS AND READING AGE FOR GRADES 4-6-8 PUPILS 


“atl wUOUUlUllwU_"W‏ پپپ پپپ 


No. 
Factor Grade Cases Recognition Identification Context 
— ——— ——— —— —— ——A————— ————————— P ERREUR ERR ННВ 
R. A. 4 91 . 118 + .04 · 816 + . 03 . 830 + .03 
В.А. 6 99 . 873 + .02 · 887 + .02 „816 + .02 
В.А. 8 98 . 826 + .03 ‚881 + .02 5811 + „09 


TABLE ХШ 


CORRELATIONS BETWEEN SCORES ON THE CONSTRUC TED MULTI-MEANING 
SUB-TESTS AND CHRONOLOGICAL AGE OF GRADES 4 - 6 - 8 PUPILS 


ي و 


No. 
Factor Grade Cases Recognition Identification Context 
С.А 4 97 -. 257 + . 03 -. 255 + .05 -.277 + .02 
А 6 99 -.117 + .10 -.453 + .08 -.430 + .08 
С.А 8 98 a. 115 4410 -. 343 + .09 -. 441 + .08 


MEME O an ip анааан 
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TABLE XIV 
CORRELATIONS BETWEEN SCORES ON THE CONSTRUCTED MULTI-MEANING 
SUB - TESTS AND GATES VOCABULARY TEST FOR GRADES 4 - 6-8 
No. 
Test Grade Cases Recognition Identification Context 
Gates . . E 
Vocabulary 4 97 . 755 + .04 . 804 + .04 . 799 + .04 
Gates 4% 
Vocabulary 6 99 . 876 + .02 - 883 + .02 . 881 + .02 | 
Gates „* M | 
Vocabulary 8 98 . 826 + .03 -882 + .02 . 871 + .03 | 
| 1 
fh , 
: 3 
TABLE XV - 4 
CORRELATIONS BETWEEN SCORES ON Т 


SUB-TESTS AND THE GAT 


Grade 


ES COMPREH 


Recognition 


HE CONSTRUC 


Identification 


TED MULTI-MEANING 
ENSION TEST FOR GRADES 4 - 


6-8 


Context к= 


4 


97 


99 


98 


„716 + .05 
„159 + .04 


. 802 + .04 


+ 154 + .05 
801» . 04 


811 + .04 


4758 + .03 


. 791+ .04 


„871 + .03 | 

а че 4 
| 

[ 
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+ TABLE XVI 


CORRELATIONS BETWEEN THE THREE SUB- TES 
= TS OF = 
MEANING VOCABULARY TEST FOR GRADES 4 құман 


Grade ? Tu Tests I-II Tests 1-Ш Tests П-Ш 
« 4 : 2091 . 828 + .03 . 834 + .03 .821 + .03 
" ше © 99 .885 + .02 .897 + .02 .885 + .02 
виа 98 . 878 + .02 . 817 + .03 .861 + .03 
, 
Ч % 
~ « 
le TABLE XVII 


m d 
CORRELATIONS BETWEEN SCORES ON THE THREE FORMS OF THE 
MULTI-MEANING VOCABULARY TEST FOR GRADES 4 - 6 - 8 


. No. Forms Forms Forms 
Grade Cases A-B A-C Ba 0 
= | ——--<——-_—— 

4 o7 „850 + .03 . 812 + .04 .813 + .04 
6 99 . 831 + .03 .898 + .02 .822 + .03 
8 98 „812 + .03 .827 + .03 . 869 + .03 


ee 
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i from the same 
1. The results were obtained 
population without having them take the test 
a second time. | 
2. No complete form was given at the same 
time as the items had notbeen selected for 
the final tests when the preliminary tests 
were given. | 
3. All the tests were corrected а second time. 


Validity of the Tests 


In determining the validity of any test, it is 
necessary to find the correlations between the cri- 
teria used. 


California (p. 110) 

Reading Age 

Chronological Age 

Gates Reading Survey for Grade 2 (2nd half 
to 10) 

(Vocabulary, Level of Comprehension, 
Speed and Accuracy) 


Table XI gives the coefficient of correlation 
found between mental age secured {гот Ше above 
mental test andthe scores on the sub-tests of the 
Multi-Meaning Vocabulary Tests. 

It is recognized that the criterion of mental 
age represents as satisfactory an independent in- 
dex of the ability necessary for vocabulary as can 
be secured. 

The use of another factor was made to assure 
the validity of the multi-meaning tests. Reading 
ages secured from the results of theGates Read- 
ing Survey were obtained. Table XII sho ws the 
correlation found between this factor and the con- 
Structed tests. 

The correlation with reading age is high, since 
the task of identifying and knowing the definition 
of words is a component of reading. Table XIII 
presents coefficients of correlations between 
chronological age and the scores obtained on the 
sub-tests of the Multi-Meaning Vocabulary Tests. 

The correlations between chronological age 
and the multi-meaning vocabulary sub-tests in 
each instance were found to be negative. This 
finding denotes many of the oldest pupils in the 
sample obtained the lowest scores and many of the 
youngest pupils had the highest scores. 

To secure additional data bearing upon the va- 
lidity of the constructed tests, correlations be- 
tween subtests in the Gates Reading Survey Test 
were found. This test is divided into three sub- 
tests and a correlation between the vocabulary 
test and the multi-meaning vocabulary tests was 
made. 

Table XIV presents data on the correlations 
that have been found. 

All correlations between the Gates Vocabulary 
Test and the multi-meaning sub-tests are high, 
This finding indicates that the pupils who made 


high scores in the Gates Vocabulary Test made 
high scores on the constructed tests while pupils 
making low scores on the Gates Vocabulary Test 
made low scores on the multi-meaning Vocabu- 
lary Tests. 

With the total scores obtainedon the Gates Com- 
prehension Test, acorrelation was made with the 
Multi-Meaning Sub-tests. Table XV shows the 
findings. 

Gates Comprehension Testand the Multi-Mean- 
ing Vocabulary Sub- Tests are all statistically high. 
It is significantthatthese two Gates sub-tests cor- 
relate about equally well with the constructed 
multi-meaning vocabulary tests, and that there 
are consistently high correlations. 


Reliability of the Tests 


Basic statistical procedures were necessary 
to determine the reliability of the constructed 
tests. Coefficients of reliability for the sub-tests 
were found. Table XVI shows the relationships 
on these three sub-tests. 

It was to be expected that pupils who obtained 
а high score on one of the sub-tests would be like- 
ly to secure a high score on a similar test which 
measured a like skill differing only in application 
of technique. 

The large number of multi-meani ng words 
used in the preliminary tests made it possible 
after the test was refined to organize the material 
into three forms. Each form included a recogni- 
tion, identification, and context test. In order to 
ascertain whether or not the pupils obtained а cor- 
responding Score on each form of the test, coeffi- 
cients of reliability were found. Table XVII pre- 
Sents a comparison of the correlations obtained 
on Forms A, B, and C of the Multi-Meaning У0- 
cabulary Test. 

High correlations were found among all the 
forms of the Multi-Meaning Vocabulary Test. In 
view of the fact that eachform was equally weight- 


ed in difficulty, discriminating level, and number 
of meanings, i 


me it is not surprising that such high 07 
efficients of reliability were — " 


Conclusions 


Basic statistics for the evaluation of this test 
were presented. The following facts are apparent! 


1. More words have 
mately the same time a 
ing vocabulary test, 


2. The tests are highly valid, 


3. The different form. 
1 5 апа = sho 
very high reliability. e 


4. A more subst 
knowledge of words 
meaning is secured, 


been tested in approXi- 
nd space as a single mean 


antial inventory of pupils’ 
which have more than опе 
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THE DEVELOPMENT OF THE EARLY SCHOOL 
PERSONALITY QUESTIONNAIRE* 


RICHARD W. COAN 
University of Arizona 
RAYMOND B. CATTELL 
University of Illinois 


IN THE course of an extensive investigation of 
personality in middle childhood, a 200-item group 
questionnaire was constructed. It was adminis- 
tered to two samples of first- and second-grade 
children. One sample consisted of 151 children 
from the public schools of Mahomet and Rantoul, 
Illinois. The other consisted of 181 children in 
Decatur, Illinois. The item data from the two 
samples were subjected independently to factor 
analysis. Each study culminated in 18 obliquely 
rotated factors. This work was reported in two 
earlier articles (4,7). In one of these articles 
(7), the alignment of the two sets of factors was 
considered. When а combination of criteria was 
applied, it was found that there was reasonably 
clear matching for 11 factors from either study 
with a corresponding set of 11 factors from the 


other study. 
Construction of the ESPQ Scales 


It was decided that the eleven matching factor 
pairs represented the most stable, reproducible 
factors underlying the questionnaire data, and 
these were accordingly chosen for inclusion in a 
new factored questionnaire test. In addition, a 
large-variance factor appearing in the later, but 
not the earlier, study was accepted for represent- 
ation in this test. This decision seemed justified 
by the fact that the later study utilized a larger 
sample and yielded a closer app roximation to 
simple structure. In addition, it was thought de- 
Sirable to include a thirteenth factor of general 
intelligence. Because of the nature of the origin- 
al item battery, of course, such a factor had not 
appeared in either analysis. 

For each of the 12 personality factors, two 
six-item scales were constructed. Each scale 
met the usual criteria for factor-scale construc- 
tion: a) adequate representation of the given fac- 
tor in terms of item correlations with the given 
reference vector in both studies, b) suppression, 
Or balancing out, of irrelevant factor varl ance, 
C) an equality of a and b keyed alternatives. 


*The research reported in this article was supported by a grant from th 


of the State of Illinois. 


For intelligence, two eight-item scales were 
constructed. To insure adequate breadth, each 
Scale was assigned items of four different types: 
synonyms, reasoning, Subordination, and зирег- 
ordination. These were designed to blend in form 
with the personality items, and they utilize the 
same two-alternative response form. In difficulty 
level, they were aimed at the six-to-eight-year 
level. In the course of standardization, data will 
also be gathered on a battery of alternative items 
50 that any necessary item replacements can be 
made. 

Thirteen pairs of scales were thus constructed. 
These were segregated into two questionnaire 
forms (Forms A and B) for a test which has been 
designated the Early School Personality Question- 
naire (ЕВРО). Each form contains 12 six- item per- 
sonality scales and one eight-item intelligence 
Scale, makingatotal of 80items. The correspond- 
ence of the factor scales, considered in final nu- 
merical order, tothe factors in the two studies is 
shown in the following list (where ‘‘D’’ refers to the 
study of Decatur children апа “МН” to the earlier 
study basedon data from Mahomet and Rantoul): 


рі, MR 2 (-) 
D2, MR 4 
MR 3 
D4, MR1 


(>) 
о 


D6, MR 5 
D8, MR 6 
D7, MR 15 
. D9, MR 10 (-) 
10. D10, MR 11 (-) 
11. D13, MR 7 (-) 
12. D15, MR 14 
13. Intelligence 


ФФ сора сэл сорон 
(>) 
a 


Because of the brevity of the scales, it is as- 
sumed that neither form will be used in isolation, 
except in rare circumstances. The division into 
two forms, however, should facilitate two-session 
administration. Each form will require 30 or 40 


e Department of Public Welfare 
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i ftesting time. Administration of the 
sete eb ata Бізде session would ре advisable 
only with older subjects who can accept a fairly 
шт, м form and a key have been specially 
designed to permit easy and rapid scoring. For 
each form, there is a four-page answer form. 
Each page contains twenty rectangular boxes ar- 
ranged in two columns. In the middle of each box 
is an item number, as well as a picture which can 
serve the same purpose for younger subjects. 
There is an A at the left end of each box and a B 
at the right end. 

The standard instructions are an adaptation of 
those employed in the original researches. For 
each item, the examiner first directs the class to 
the appropriate box and then reads the item. Mark- 
ing instructions are given to identify each alter- 
native with the A or the B. The children are 
asked to indicate their responses by drawing a line 
through either the A or the B in each box. 

To give the reader some idea of the content of 
each scale, we shall list the questions utilized for 
each factor scale. The keyed alternative corres- 
ponding to the high end of the scale is underlined, 


Factor 1 


Form A 


1. Would you rather play: a) school, or b) cow- 
boys and Indians? 


2. Which would you rather be: a) a doctor, or b) 


a teacher? 

11. Would you rather: a) fly an airplane, or b) be 
a teacher? 

21. Do you like to talk to teachers? a) yes, or b) 
no. 

31. Do you ever talk back to your mother? a) yes, 
or b) по. 

32. Would you rather: a) go to a party, or b) stay 
home and play? 

Form B 


1. If you were in a play, would you rather be: a) 
a teacher, or b) a hunter? 


2. If another child has your coat, do you: a) take 

it away from him, or b) tell the teacher? 

11. a) Do you like to See other children cry, or b) 
does it make you sad? 

21. Would you rather: a) be in a pla 
something out of wood? 

31. Do you ever feel like running away from home? 
a) yes, or b) по. 

32. Would yourather have: a) a new baby come to 
live with you, or b) a little dog come to live 
with you? 


y, or b) make 


Factor 2 


Form A 


3. По you shiver when you hear a squeaky door 
or chalk scraping on the blackboard? а)Уез, 
or b) no. 

12. Do you like: a) all the things your mother 
gives you to eat, or b) only some of them? 

13. When your mother tells you you can’t do some- 

thing, do you want todoit even more? a) Yes, 

or b) no. 

Does it bother you if people say you don’t lis- 

ten? a) Yes, or b) no. 

23. When you аге told to do something or put some- 
thing away: a)doyou always do it right away, 
or b) do you sometimes forget what you are 
supposed to do? 


33. a) Do you have a lot of fun, or b)do things 
Sometimes go wrong? 


22. 


Form B 


3. Do grown-ups ever Say you daydream too 


much? a) Yes, or b) no. 

Would you rather: a) play by yourself, or b) 

play with other boys and girls? 

Do you ever feel like Crying when you see 

Something sad in the movies?'a) Yes, or b) 

no. 

22. Do people ever call you naughty and mischie- 
vous? a) Yes, or b) no. 

23. a) Are other children always nice to you, or 
b) do they sometimes pick on you? 


33. a) Are you happy all the time, or b) do you 
Sometimes get sad? 
nites get sad 


12. 


13. 


Factor 3 
Form A 


4. a) Are your dreams usually nice, or b) do they 
scare you? ее 


* Would you rather; 3) £0 on along trip in the 
: саг, or b) go to school? 
4. When your friends want to play a different 
game, do you: a) play their game, or b) go 
on with your game? 


- When you lose а book, 4 5 b) 
ЭНЭ › do you: а) cry, or 
34. Which do you like better: a) funny books, ог 

- b) the books you have in school? 
Я bes you zalier tell your mother and father: 
‚ about school, or b) abo: layed 
with your friends? шаасан 


Еогш В 
4. Do you like to 


2 : Play: a) hard games that you 
ТЕШ win, or b) easy games where nobody wins? 


| 


-— 
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9. Would you rather: a) work at home, or b) go 

to school? кеті 

14. Do people ever say you talk too much or call 
you a chatterbox? a) Yes, ог b) no. 

24. Do you think: a) school is hard, or b) school 
is easy? 

34. Would you rather: a) watch a game, or b) 
learn something in school? 

35. a) Do you think grown-ups shouldlisten to you 
and help you more than they do, or b) do you 
like it better when they just leave you alone? 


Factor 4 
Form A 


6. Would you rather have: a) afriend who can 
read well, or b) afriend who is good at ball 
games 2 

15. Would you rather: a) climb a tree, or b) look 
at a book? 

16. Do you think: a) everybody likes you, orb) 
only some people? 

25. Would you rather look at: a) comic strips that 
are funny, or b) comic strips with a lot of 
fighting and shooting in them? 

26. Do people ever say you’re stuck-up? a) Yes, 
or b) no. 

36. a) Can you touchabig bug, orb) are you afraid 
to touch bugs? 


Form B 


6. Would you rather: a) color abook, or b) climb 


a tree? я 
15. Would you rather: a) hunt for birds, or b) 


draw pictures of birds? 


16. Would you rather: a) watch people dancing, ог 


b) hear a story about airplanes? - 

25. Would you rather: a) talk to a friend, or b) 
look at funny books? 

26. Which do you like better: a) home, or b) 


school? 
36. Do you like to climb trees? a) Yes, or b) no. 


Factor 5 


Form A 


7. Would you rather: a) talk to your friends, or 
b) talk to your teachers? 
8. Would you rather: ај stay а 
shopping with your mother? 
17. а Do un: like T talk to your teacher, or b) 
are you sometimes a little afraid to? Рети 
27. When your friends start fighting: 2, m 
just leave them alone, or b) do you tell them 
to stop it? 
37. When you get angry: a) do Y 


just cry? қ 
38. Are you as good-looking as the other children 


at home, or b) go 


ou shout, or b) 


іп your class? a) Yes, or b) no. 
Form B 


7. Would you rather: a) play a noisy game where 
you pretend to be wild animals, or b) listen 
to a story read by your teacher? 

8. When somebody bawis you out: а) do you cry, 
or b) do you get mad? 

17. When your mother and father arebusy, do you 
like to help them? а) Yes, or b) no. 

27. If you had to make yourbed: a) would you lis- 
ten to the radio first and then make it, orb) 
would you make it right away? 

37. a) Do youlike to talk to your father, or b) 
would you rather not talk to him? 

38. Y you like to cross busy streets? a) Yes, or 

no. Tm 


Factor 6 
Form A 


9. a) Do you like for other children to play with 
your toys, or b) does it sometimes bother 


you? 

18. Do you сгу: a) morethanother boys and girls, 
or b) less? 

19. Do people boss you around too much? a) Yes, 
or b) no. 


28. a) Are you always very careful how you move, 
or b) do you sometimes rush around when you 
play and knock things over? 

29. Would you rather be: а) а grown-up, or b) a 
baby? 

39. Do your friends sometimes say: a) bad things 
about you, or b) only true things? 


Form B 


9. If you get upset or sad: a) do you get happy 
again pretty soon, or b) do you stay sad for a 
long time? 

18. When you get hurt, do you: a) cry, or b) try 
to keep from crying? 2-5: 

19. Do people sometimes punish you when you 
haven’t done anything wrong? a) Yes, or b) 
по. 

28. Do you like to play: a) new games, or b) just 
games you already know? 

29. When you get hurt: a) do you just try to forget 
about it, or b) do you sometimes cry? 

39. Is it sometimes hard to get people to under- 
stand what you're saying? a) Yes, or b) no. 


Factor 7 


Form A 


41. Would you rather: a) listen to a story, or b) 
watch two dogs fight? 
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42. Who usually has better ideas: a) you, or b) 
ur friends? 

51. Же you pretty good at: a) everything, ог b) 
just a few things? 

61. a) Are you getting along pretty well, or b) do 
you have a lot of problems and troubles? 

71. Do you like to tell stories to other people? 
a) Yes, or b) no. 

72. Do you usually: a) do what other people say, 
or b) do what you want to do? 


Form B 


41. Which do you like better: a) hearing stories 
about what a boy or girl does, or b) really 
doing things yourself? 

42. a) Does your mother let youdo almost anythin 
you want, or b) are there lots of things she 

won’t let you do? 

a) Are you stronger thanother children, or b) 

are they stronger than you? 

When you argue with people: a) do you some- 

times find out that you were wrong, or b) are 

you nearly always right? NN 

When you want to say som ething: a) do you 

just say it, or b) do you think it over first? 

a) Do you have to do things you don't want to 

do, or b) do you always do what you want to? 


51. 


61. 


71. 


12. 


Еасїог 8 
Form А 


43. По you like: a) to play with dogs, or b) to 

stay away from them? 

52. Would you rather: a) go to a party, or b) lis- 
ten to the radio and TV? = 

53. Do you get into: a) more trouble than most 
other children, ог Б) less trouble? 

62. а) Do you like todo things the way your mother 
says, or b) is your own way better? 

63. When children play tricks on you: a) do you 
cry, or b) do you get mad? 

73. Which do youlike better: a) cats, or b) dogs? 


Form B 


43. Which would you rather be: a) а dog, or b) a 
cat? 


52. a) Are you always neat and tidy, or b) are you 


sometimes careless and messy? 

53. Which wo would you like better: а) a story about 
fighting Indians, or b) astory about how In- 
dians made clothing? 

62. Would you rather: a) run, or b) sit still? 

63. Would you rather p s for: a) your 

rb ur ! 

73 mother, ave days When everything goes 

қ wrong. of b) are you happy all the time? 


Factor 9 


Form A 


44. Does anyone evercallyouacry-baby? a) Yes, 
or b) no. T 

45. a) Are you always pretty lucky, or o more 
bad rel happen to you than to other children? 

54. Is your teacher: a) nicer to other children, 

orb) just ав пісе to you? 

64. On the playground: a) do you makealot of 
noise, or b) are you mostly quiet? 

74, Would yourather: a) build things with your 
friends, or b) build things by yourself? 

75. Do you think people ever Say bad things about 
you behind your back? a) Yes, or b) no. 


Form B 


44. Do you know any children who are so dumb 
that it’s no fun to play with them? a) Yes, or 
b) no. 


45. Do you like to: a) run, or b) just walk? 


54. On theplayground: a) do you play by yourself, 
or b) mostly with other children? 

64. a) Do you have a lot of friends, or b) just а 
few friends? 

74. On the playground: a) do you run most of the 
time, or b) stand still a lot? 

75. Do you like to: а) help your mother and father 
with things, or b) just play? 


Factor 10 
Form A 


46. Would you rather go опа trip with: a) your 
father, or b) your mother? 

55. If you wake up in the dark: a) do you some- 
times feel scared, or b) do youlike it because 
it’s so dark and quiet? 

56. юэ pud rather: a) play a noisy game, OF 

ook at a book 

65. When po. by yourself? 


ple are talking about a movie that you 

have Seen: a) do you want to tell it your way, 
Or b) just listen to them? 

66. When your mot 


her is angry: a) do you feel 
like crying, or b) зорин 


do you feel ва; anyway? 
76. If you were up on à big rock: S bon you be 
Scared, or b) would you just laugh? 
2-8 you just laugh 
Form B 


46. Would you rather play with: a) older children; 
or b) younger children? ва 
55. Which would you like better: a) to hear вісгіез 
ош bears, or b) to have bears here right 


= 0943 ве 


56. Would you rather talk to: a) your father, or b) 
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your mother? 

65. When you go to bed: a) do you lie awake for a 
long time, or b) do you go to sleep right away? 

66. When youstart to Say something: a) do grown- 
ups.always listen to you, or b) do they do all 
the talking? 

76. Do you sometimes feel a little Scared when 
you're up on a high place? a) Yes, or b) no. 


Factor 11 
Form A 


4". When you get a new toy: a) do you let other 
children play with it, or b) do you watch out 
that no one breaks it? 

48. Does your mother ever go away for a little 
while and leave you at home all by yourself? 
a) Yes, or b) no. 

57. Do you like: a) a friend who talks a lot, or b) 
one who is quiet? 

67. Do you ever take your toys to bed with you? 
a) Yes, or b) no. 


77. Would you rather play: a) in your own yard, 
or b) in somebody else’s? 
78. Are you ticklish? a) Yes, or b) no. 


Form B 


47, Can you do things: a) better than most boys 
and girls, or b) not as well as most boys and 
girls? 

48. a) Do people sometimes bawl you out when you 
haven’t done anything wrong, or b) do they 
always treat you the way they should? 

57. a) Does someone wake you up in the morning, 
ог b) do you wake up by yourself? 

67. Do you ever talk inyour sleep and wake your- 


fup? a) Yes, or b) no. 
77. 4 сен A you had more Iriends to play 
with, or b) do you have enough frien s? 
78. When your mother and father say it's time for 
bed: a) do you like to go to bed, or b) do you 


want to stay up longer? 


Factor 12 


Form A 


49. Do you like: a) to meet new boys and girls, 
or b) to be with children you already know? 

58. Do youlike: a)to tell other children wiat 3 
do, or b) do what other children want to ^ : 

59. Do you ever do things that you should not do 


a) Yes, or b) no. with a lot of 


68. Do you like to play games: а) vith 2 Pa 
children, or b) just one or two children that 
you know? р 

69. a) Can you remember stories, 
forget them very soon? 

79. Would you rather: a) make som 


or b) do you 


ething with 


blocks or bricks, or b) play with other chil 
dren? 


Form B 


49. a) Do you make your bed inthe morning, or b) 
does your mother make your bed? 

58. Man get m а) doyou sometimes yell 

mp yo j 

i mee s eet, or b) do you just try to 

59. Would you rather: a) look ata picture book 
by yourself, or b)look at it with another boy 
or girl? 

68. When children fight, do you think grown-ups 
ce punish the wrong one? a) Yes, or 


69. Do you: a) usually put your clothes away at 
night, or b) just leave them anywhere? 
79. If people don’t want to do the same things as 


you: a) does it make you mad, or b) do yo 
just do what they want to do? А | z 
=== бу wantto:do 


Interpretation of Factors 


In our earlier reports, we offered some gener- 
al interpretations of questionnaire factors at the 
middle-childhood level. We have refrained, how- 
ever, from any extensive interpretation on the ba- 
sis of item content alone. But we have now as- 
sembled personality scales for twelve factors on 
a purely statistical basis. We can offer sound 
mathematical arguments for the utility of these 
scales, for they have been designed to cover a 
broad area of personality with maximal economy. 
The actual utility of these scales, on the other 
hand, now depends on our unde rstanding of their 
psychological referents. 

Apart from manifest item content, there are 
two chief avenues by which we can hope to gain 
some insight into the factors that we are measur- 
ing: a) we can determine the way in which ques- 
tionnaire scores co vary with other kinds of per- 
sonality measures, and b) we can link this ques- 
tionnaire instrument with those developed for other 
age levels by using a sample at an intermediate 
age level. The first type of comparison can be 
conveniently made, since we also have teacher- 
rating, parent-rating, and objective-test data for 
our Decatur sample. The analyses of these data 
have been reported elsewhere (1,3,5). In another 
article (2), we shall consider interrelations among 
factor scores derived from all of our various 
media of measurement and describe a common 
factor analysis of these scores. 

For the other type of comparison, it is conven- 
ient to relate the ESPQ tothe High School Person- 
ality Questionnaire (HSPQ). Thelatter test, which 
is described elsewhere (6), was devised for the 
12-to-16-year range. It contains 14 scales which 
have been identified in terms of factors that have 
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d in studies of adults and pre-adolescents. 
x ees was to give the ESPQ and the HSPQ to a 
sample at about the nine- and ten-year level. It 
must be recognized, of course, that some of the 
ESPQ will seem a bit ‘‘childish”’ to Such a group, 
while portions of the HSPQ will be incomprehen- 
sible to them. Some resultant distortion of cor- 
relations between the two tests is to be expected. 
No other procedure, however, permits such a di- 
rect comparison of questionnaire factors at the 
two age levels from which the two tests werede- 
rived. 

We can report findings for two independent 
samples.* The first sample consisted of two 
fourth-grade and two fifth-gradeclasses in Terre 
Haute, Indiana. There were 46 boys and 59 girls, 
making a total of 105. In age, they ranged from 
nine years and five months to eleven years and 
eleven months, with a mean of ten years and five 
months. These children received ail items of the 
ESPQ, exceptfor factor 13, at one session and 
Form A of the HSPQ at another Session. The 
product-moment correlations between the ESPQ 
total scores and Form A scores for the HSPQ are 
shown in Table I, 

А second sample was secured in Kent, Ohio. It 
consisted of one third-grade class and three fourth- 
grade classes. There were 43 boys and 49 girls, 
or a total of 92 children, rangingin age from eight 
years and five months to ten years and eleven 
months, with a mean of nine years and nine months. 
Forthis sample, we were able to Obtain total 
Scores, based on both forms, for all scal es of 
each test. The lower age level of the Kent sam- 
ple is conducive to some loss of reliability and 
validity for the HSPQ. Any suchlos S, however, 
was probably offset by the increased Stability af- 
forded by the use of both forms of this test. In 

fact, we might justifiably attach more weight to 
the Kent data than to the Terre Haute data. The 
product-moment correlations between ESPQ and 
HSPQ scores for the Kent sample are shown in 
Table II. 

In each of our tables, there are several rows 
and columns in which more than one substantial 
correlation can be seen. This suggests the defi - 
nite possibility of age changes in personality struc- 
ture. At either age level (six-to-seven or twelve- 
to-sixteen), there could be factors intermediate 
betweenfactors appearingattheotherlevel. There 
could be processes of factor convergence, where- 
by eleven-year factors are discernible at the six- 
year level only in the second-order realm, or of 
factor divergence, whereby six-year factors be- 
come second-order at the eleven-yearlevel. Our 
present data, however, do not provide a sufficient 
basis for drawing any suchconclusions. The pic- 


ture is complicated by the approximate character 
of factor scores estimatedby the present methods 
and by the fact that, at every age level thus far 
studied, there are some factors that inevitably 
tend to “cooperate”, or share high-loading vari- 
ables. In the construction of item scales for со- 
Operative factors, a complete suppression of іг- 
relevant factor variance is often not possible. For 
this and other reasons, the intercorrelations with- 
in one set of factor scales will not faithfully rep- 
resent the actual intercorrelations among the fac- 
tors. Some less severe distortion of correlations 
with outside measures is to be expected. While 
possibilities for developmental shifts in factor 
structure thus exist, they must be examined inthe 
light of other evidence than that presented here. 

In any case, some of the ESPQ factors can be 
identified with some certainty if both the item con- 
tent and the HSPQ correlations are carefull y 
weighed together. Any given correlation, of 
course, must be evaluated in the light of all other 
correlations obtained for its component variables. 

Factor 1 of the ESPQ shows some affinity with 
factors A, E, andI of the HSPQ. Since Е апа 
seem to be represented elsewhere, factor A looms 
as the best match. Such an interpretation appears 
to be consistent with the content of the items. The 
items, however, stress primarily a form of so- 
cialization incidental to adjustmentin a school en- 
vironment. This Suggests something akin to fac- 
tor K, or Socialized Morale, which has appeared 
only in rating data in previous studies of older 
groups. Inits present form, then, factor 1 may 
be identified ав A- (Schizothymia vs. Cyclothymia) 
or as К- (Dislike of Education vs. Socialized Mo- 
гаје). 

In factor 2, we see 
Subjective distres 
Depression) 
Same time, t 
Suggeststhe 
ness of fact 
ашу). Thi 


consistent with 
G (Superego Strength). 


to combine emotional sensi- 
and aesthetic interest in a way 
factor I (Premsia or Emotional 


Sensitivity), but the Opposing content is strongly 


ocurement of these samples, the authors are indebted to Dr. Rutherford В. d Dr. 
po. Т рудің and to the cooperating school personnel of Ter H ЕЕ ВРЕМЕНА 
па В. 


re Haute, Indiana, and Kent, Ohio. 


JM 
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suggestive of factor E (Ascendance vs. Submis- 
siveness). The correlations support interpreta- 
tions in terms of factor A, factor E, and factor I. 
Factors A and I are probably both better repre- 
sented by other factors in the present set. Fur- 
ther evidence may clearly identify the present fac- 
tor as E- (Submissiveness vs. Ascendance). 

The most prominent feature of factor 5 seems 
to be a certain dependence on parents and teach- 
er, with a conformity to their wishes. The cor- 
relations are generally consistent with this con- 
tent, but they do not point to any single clear 
match with a recognized factor. The two matrices 
agree in relating this factor to A, D-, E-, G, and 
H, but each of these seems to be better represent- 
ed by another factor in the set. 

Factor 6 shows emotional control and tolerance 
of frustration at the high end and a more ready 
loss of composure at the low end. The correla- 
tions most strongly suggest identification of this 
factor as D- (Phlegmatic Frustration Tolerance 
vs. Infantile Emotionality or Excitability), but the 
content is also quite consistent with O- (Confi- 
dence vs. Anxious Depression). 

At the high pole of factor 7, we see an aggres- 
sive self-assertion which is cons istent with the 
pattern of factor E (Ascendance vs. Submissive- 
ness) at lower age levels. Another possible in- 
terpretation is F (Surgency vs. Desurgenc у), 
which, unlike Е, is apparently not represented 
elsewhere in the present set of factors. Factor 8 
has no unambiguous match, though the correla- 
tions indicate some relationship to A, E, G, 1, J, 
and Q3. The pre vailing theme running through 
the items of this factor might be characterized as 
**motor expressiveness VS. socialized restraint." 

In the present correlation data, factor 9 aligns 
best with C. The item content accords generally 
with sucha match, though it is a little more 
strongly suggestive of AorH. The high scorer 
appears to combine certain schizothyme qualities 
of quietness, 5 eclusiveness, and sullenness. A 
and H are supported by the correlations, though 
less strongly than C. Since A appears more clear- 
ly elsewhere, factor 9 may be best identified as 
H- (Withdrawn Schizothymia VS. Adventurous Cy- 
clothymia) or C- (Poor integration vs. Ego 
Strength). est with I, and the соп- 


Factor 10 correlates b 11, сай 
tent is clearly consistent with this identification. 


At its positive pole, there is а certain holsters 
ness and toughness. At the negative pole, we fin 
fearfulness and timidity- We may tentatively 
identify this factor as I- (Rigid, Tough Poise ҮЗ. 
Sensitive Emotionality). : 

Factor 11 shows m marked relationship to any 
HSPQ factor, and no clear over all pattern : 
readily discernible in the items. It Seere ста 
to refrain from interpretation until further a : 
are available. Factor 12 presents а pattern а 
friendly sociability and calmness at the high епс. 
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The aggressiveness at the lowendofthe scale ac- 
counts for a high negative association with E, but 
the overall pattern at ће high end is not that of 
submissiveness. It is more nearly one of cyclo- ` 
thymia. Both A andH, however, appearto be rep- 
о Q4 (Low Nervous Tens ion 
- High Nervous Tensi i i i 
odi sion) is another possible in- 
Factor 13 was not used т Ше Terre Hautes 
In the Kent study, it correlates арргвстайу p. 
with factor B (Intelligence). This unambiguous 
finding actually exceeds expectations, for a corre- 
lation even as high as that obtained (+. 32) is sur - 
prising in view of the factthatthe ESPQ and HSPQ 
intelligence scales are severely restricted at op- 
posite ends of the rangein our sample. At the in- 
termediate age level of theKent sample, the ESPQ 
fails to discriminate at the high end of the range 
while the HSPQ fails to discriminate atthelow end. 


Summary 


Two equivalent forms were constructed for a 
factored personality questionnaire suitable for 
group administration to children in the six to eight 
year range. Each form contains 12 personality 
Scales derived from two earlier factor-analytic 
studies. A thirteenth scale for intelligence was 
added to each form. Scores on the questionnaire 
were correlated with those for a questionnaire de- 
signed for older children for two independent sam- 
ples. Inspection of the correlation data and of the 
items themselves permits satisfactory identifica- 
tion of most of the factors. There are indications 
that the test taps each of the following known fac- 


tors: 


Cyclothymia vs. Schizothymia 

General Intelligence 

Ego Strength vs. Poor Integration 

Infantile Emotionality 

Ascendance vs. Submissiveness 

Surgency vs. Desurgency 

Superego Strength vs. Dependent Im- 

maturity 

Adventurous Cyclothymia vs. With- 

drawn Schizothymia 

Т; Emotional Sensitivity vs. Toughness 

K. Socialized Morale vs. Dislike of Ed- 
ucation 

о. Anxious Depression 

04. Nervous Tension 


ш ознрорр 


In general, it seems inadvisable to accept the 
factor identifications which we have indicated as 
final. There are some departures from the fac- 
tors found among adults that accord with other 
findings on the composition of known factors in 
childhood. There are a few factors that do not 
align clearly with any known factors. Further re- 
finement of the scales should reveal whether these 
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represent real structural novelties atthe middle- 
childhood level or simply poor approximations to 
identifiable factors. A study dealing with the re- 
lationships among questionnaire, rating, and ob- 
jective-test factors (2) may provide further clar- 
ification. 
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Problem 


NON-VERBAL group tests have received wide 
attention in psychological testing programs al- 
though their use in this country has been rather 
limited. The most extensive use of the non-verb- 
al group test has been in the military services 
where the problem arises of processing and clas- 
sifying great numbers of illiterates and linguisti- 
cally-handicapped persons. Basically these non- 
verbal tests have been of the general classifica- 
tion and academic aptitude type and have been 
used primarily for screening and classificato ry 
purposes. Their development and use has 
Stemmed from the premise that personnel with 
linguistic and reading handicaps are penalized on 
a test score basis оп the attribute ostensibly be - 
ing measured. However, thedevelopment of these 
tests has been seriously handicapped by the diffi- 
culty of producing c ompletely non-verbal items. 
Except for the Army Beta Examination (8) and the 
Semantic Test of Intelligence (5) in which the di- 
rections are given completely in pantomime, 
mostly so-called non- verbal tests are merely 
partly non-verbal tests. Furthermore, they are 
difficult to administer, expensive to produce, and 
the individual items in these tests are generally 
too easy for personnel of superior ability. Prob- 
ably for these and other apparent reasons the use 
of non-verbal group tests has been quite limited. 
In any event, even their limited use has been fur- 
ther restricted to the area of classification test- 
ing and these tests have not found application in 
the field of achievement testing where the per- 
formance test generally replaces the paper-and- 
pencil test when the latter is considered inappro- 


priate. 


In the U. S. Navy, a system of competitive 
service-wide examinations for the selection and 
advancement of petty officers has been established. 
All candidates for petty officer rates in the 60-odd 
naval enlisted occupations who have met minimum 
requirements and who have been recommended by 
their superior officers for advancement to the next 
higher pay grade level compete on these examina- 
tions which are administered semi-annually 
throughout the naval service. These examinations 
consist of 150 multiple-choice items designed to 
sample the professional and military knowledges 
and skills required for successful job perform- 
ance in the rate being sought. They are further 
designed as highly discriminating selection tests 
since only those candidates are allowed to compete 
who have been selected by their commanding offi- 
cers and who have been certified by them princi- 
pally on the basis of training and on-the-job per- 
formance to be at least minimumly qualified for 
advancement. 

In order to provide for a universally fair and 
equitable system, all candidates for advancement 
are required to compete with their contemporar- 
ies on the identical standard. The principal com- 
ponent of this selection criterion is the standard- 
ized paper-and-pencil achievement type test de- 
scribed above. 

Among the 60-odd occupations which the U. S. 
Navy maintains in its petty officer structure at 
four pay grade levels, there is considerable vari- 
ation both between and within groups іп verbal and 
linguistic ability. In the more highly specialized 
and technical ratings such as Electronics Techni- 
cian, Air Controlman, and Aerographer, although 
there is reasonably high variance in this attribute 
within the groups, the overall ability is, on the 
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nerally high and does пої appear to 
quie ach a a problem in terms of its con- 
tribution to error variance on the advancement 
test score variable. However, in the less techni- 
cal service ratings suchas Steward’s Mate, Ships 
Serviceman, and Commissaryman, the relatively 
high variance coupled witha generally low overall 
verbal ability has led to the formulation of the hy- 
pothesis that completely verbal items in the ad- 
vancement tests tend to reduce the validity of the 
measurement of job proficiency for members of 
theselow verbal ability groups. The study re- 
ported in this paper was designed to test this hy- 
pothesis. 


Procedures and Results 


It was decided to conduct this res earch in the 
rating of Steward’s Mate at the level of third class 
petty officer candidates since the Steward’s Branch 
in the Navy represents the lowest level of verbal 
ability. In addition to the generally low verbal 
ability of personnel in this rating, there exists the 
additional feature of linguistic-handicap (bilingu- 
alism) caused by the employment of great numbers 
of native-born and native-educated Filipinos. 

For the purposes of this study verbal ability is 
defined and measured by performance on the Gen- 
eral Classification Test of the Navy Basic Test 
Battery. Factor analysis (9) andcorrelational (7) 
studies have shown this test to possess an excep- 
tionally high verbal factor loading. Scores оп this 
test are reported ав Navy Standard Scores (X = 50; 
0-10). From samples drawn from among сап- 
didates for advancement to petty officer, third 
class in 52 naval occupations in February 1955, 
Steward's Mates proved to represent the signifi- 
cantly lowest level on verbal ability (X = 35.70; 
o = 8.40). Thenext higher level was reached b 

Gunner’s Mates (X = 40.47; o = 7.77) (C.R. = 5.58). 

A team composed of a subject-matter special- 
ist (Chief Steward’s Mate employed at the Naval 
Examining Center as item writer for Steward’s 
Branch Examinations), a professional test tech- 
nician (civilian educational specialist normally 
assigned to the task of reviewing and editing items 
in Steward’s Branch Examinations), and а com- 
missioned officer of the Supply Corps who norm- 
ally reviews items for Steward’s Branch Examin- 
ations from thestandpoint of technical accuracy 
worked on the project of developing the exper- 
imental examination forms. 

In developing these test forms, items were 
either drawn from item card files or were con- 
structed as original items. The criterion stand- 
ards which items were required to meet in order 
to be considered acceptable for inclusion in this 
examination were: 

1) They must have sampl ed the petty officer 
requirements (1) of Steward’s Mates particularly 
at the third class level; 2) for items drawn from 
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the files, they must have been of appropriate dif- 
ficulty level (p> .10 <.90) and, also, of positive 
and significant discrimination (р> + 0. 4) (3); and 
3) the material upon which each item was to be 
based must have been of such a nature that it would 
lend itself equally well to presentation both in 
completely verbal form and in picture form (illus- 
tration item). 

Accordingly, a sufficient number of items were 
either selected or constructed to meet all three of 
the above criteria. Eachitem was developed in 
two forms: 1) completely verbal form—verbally 
presented stems and options (four choices), and 
2) illustration form—verbal stem with options 
presented in picture form. In both forms the use 
of language was kept to a minimum, andto the ex- 
tent possible the words employed were keptin the 
vernacular. 

Two test forms, each consisting of the same 
fifty items were developed. In form II, the form 
composed of the picture items, the order of pre- 
sentation was reverse from that of form I ( verb- 
alform). Inallother respects, the items were 
identical, except for the mode of presentation. 
The same explicit and careful directions and gen- 
eral instructions were presentedonthecover page 
of both forms, and both were reproduced by com- 
mercial offset printing methods. In the printing 
of the final forms, however, the illustrated form 
was reproduced about twice the size of the verbal 
form in an attempt to minimize any confusion 
which migh arise on the part of the subjects to be 
tested by the lack of detail in the pictures which 
Sometimes results from the reduction in size from 
original to final form by this method of printing. 

The experimental tests were administered in 
September 1955, to a total of 111 Stewardsmen 
representing nine different fleet units and stations 
who were assembledat the Naval AirStation, North 
Island, San Diego, California. Prior to adminis- 
tering the tests, the group of 111 subjects was 
divided into two subdivisions (М, = 67 and М» = 44). 
Sub-group one was administered the verbal test 
form first and the illustrated test form second. 
For sub-group two, the above procedure was ге- 
versed. This process was employed in order 10 
minimize the influence of **practice effect.’ Orig- 
inally, it was planned to administer the forms al- 
ternately to the entire group, but physical facili- 
ties and other factors made it necessary to divide 
the gro i i а М 

groups in this manner and in these numbers, 


and to administer the tests in two separate testing 
areas, 


Analysis of the Data with Interpretations 


Total Sample— 


2 — For the 111 subjects, means and 
standard deviations were calculated ог scores 0b- 


tained on each test form; and then employing the 
difference scores, the difference between these 
means was tested for significance. These results 
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SAMPLE QUESTION 


SAMPLE ANSWER FORM 


1. Which is NOT kept in the officers' galley? 


are presented in Table I. Only a very slight dif- 
ference was found between the mean scores ob- 
tained by the total sample on the verbal and illus- 
tration forms of this test. Statistically, this dif- 
ference borders on significance at the five percent 
level of confidence. However, it does appear that 
variation of the mode of presentation of items does 
not greatly influence the total test scores. 

American Group vs. Filipino Group—In order 
to discover whether or not the mode of presenta- 
tion of items might show different effects for the 
more seriously verbal-handicapped than for the 
lesser linguistically-handicapped, the total group 
was divided into American-born and Philippine- 
born subjects. 

The differences between mean scores on both 
test forms obtained by originally English speaking 
subjects and originally Filipino speaking subjects 
proved to be statistically significant. A further 
inspection of the data in Table II reveals that the 
mean scores and variances obtained by these two 
groups on the ve rbal and illustration forms are 
practically the same, and that they are very close 
to the total sample values. Here again, itis shown 
that mode of item presentation does not influence 
final test scores. 

Higher Verbal Group VS. Lower Verbal Group— 
A further comparison was made between the scores 
obtained on each form by subjects of higher vs. 
lower verbal ability as measured by the Navy Gen- 
eral Classification Test. Distribution of GCT 
scores of the total sample (N = 107)* yieldeda 
mean of 36. 67 and а standard deviation of и 
Qi and Qs values wore dieler Тре gan 
to be, respectively, 30. 8 and 40. 4. 1 ae 
low groups were selected as close to these n 
as possible (31 and 40). Due to skewn wy 
distribution and rounding-off procedures, the 


2 =27. 
sulting sub-group N's were Каз = 37 and Ко 


The data in Table Ш again demonstrate the lack 
of influence of mode of presentation within groups 
of both higher verbal ability and lower verbal abil- 
ity. Itis noted, however, as expected, that the 
test score means on the respective forms of the 
subjects within the higher verbalability group are 
significantly higher than those of the lower verbal 
ability group. Here, again, the hypothesis of no 
difference must be accepted for both groups. 

Study of “Practice Effect" —Since the subjects 
included in this sample were scheduled to take 
both forms of this experimental test inone sitting, 
the situation required that “practice effect" be 
controlled to the highest degree possible. Unfor- 
tunately, however, circumstances forced the ad- 
ministrators to altertheiroriginal plans of admin- 
istering the forms alternately to all subjects seat- 
ed in a single testing room. Restriction of facili- 
ties both within and outside the testing room ne- 
cessitated the splitting up of the total sample into 
two separate groups seated in two distinct testing 
areas. The sub-groups were randomly selected 
and again, due to physical restrictions it was im- 
possible to equate the groups insize. GroupI 
numbered 67 and was administered the verbal 
form first, then the illustration form. Group П 
was composed of 44 subjects, andforthem the ad- 
ministration procedure was reversed. The groups 
were kept intact and were separatedduringa short 
break period between administrations. Both were 
told that they would be taking two examinations on 
the same subject but in different forms. Neither 
group was toldthatthe items contained in one form 
were identical to those in the other form. No 
strict time limits were imposed on either group, 
but the ad ministration time for both groups was 
approximately equal. The GCT scores were stud- 
ied for subjects who took the verbal form first and 
those who took the illustration form first. 


— РЗ КСЛ т were not available. 


*There were four subjects for whi 
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j identical means (Mp = 0. 05) 
h Ба еш selecti on of subjects to 
Oron I and II prior to the administration of the 
Г indeed effective in producing two groups 
equated with respect to general verbal ability. | 
To test for the presence of any ''practice е - 
fect’ within groups, means and standard devia- 
tions were calculated on each test for those sub- 
jects who took the verbal form first and those who 
took the illustration form first, and the differences 
between means were tested for Significance, 
The data in Table V reveal the ex 


istence of 
some “ргасИсе effect." It will be noted that for 


A further Study of the « 
made b 


“Samples a] 


other with the illustration form (P> 
the verbal form IP, 01) 


Reliability and Validity of the Test Forms 


Reliability— The reliability of these form 
аба D» the Rulon (5) Method applied 5 ч 
allel forms and correlated Samples, The formy- 
la employed for determining this reliability сое{- 
ficient is identical to the formula for eriving the 
correlation coefficient between a Set of difference 
scores. 


012 + 022 _ са? 


Гүү = 
20,0; 


For these tests the reliability Coefficient Was 


found to be .55. Since the experimental forms 
were each composed of only 50 items whereas the 
standard examinations are comprised of 150 items, 
an estimate of the total test reliability was com- 
puted according to Guilliksen (2), and was found E 
be equal to . 79. This is evidence of reasonably 
high test reliability. 
For each test form a K-R # 20 reliability esti- 
mate was obtained. This formula yielded an ш- 
ternal consistency coefficient of .47 for the illus- 


tration form (50 items) and.52for the verbal form 
(50 items). 


and tested for significance proved to be homogen- 


ding further evidence of 
The lower aver- 


which the K-R #20 coefficient is 

attenuated. 
Validity — 
for establishi 


. In each case, the resulting 
correlation Coefficient Was .53. This value was 


езе item analysis 
Were then compared 


apparent rea] differences. 
represented 


al test of signifi- 
therefore could ын 
rm. А rough esti- 

e rences between parallel items 

ased upon this c tive judgment re- 

han twenty-five percent of items 

ination When presented in € 
hese item analysis data sh o wed tha 
- great majority Of the items in both forms 
Proved to be high Positive in discrimination power 
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and to possess high internal consistency. 

To test the statistical significance of the differ- 
ence between difficulty levels of the parallel items, 
a method described by McNemar (4:55-59) was 
employed. Тһе total sample of 111 cases was ran- 
domly reduced in size to 100 inorder to facilitate 
the computation of proportion values, and the pro- 
portions passing and failing each parallel item 
were analyzed. 

Of the 50 sets of item pairs (verbal vs. illus- 
tration presentation) a total of 35 yielded differ- 
ences in difficulty level which were not statisti- 
cally significant. Of the 15 pairs of items in which 
statistically significant differences occurred, nine 
proved to be less difficult in illustration form, 
and six were less difficult inverbal form. One of 
the nine item pairs which appeared to be less dif- 
ficult in illustration form (С. Е. = 2.12) proved to 
be almost as easy in verbal form (Р =. 99, РУ = 
.93). For these fifteen items, the following spe- 


cific findings are reported: 


1. Items in which fine discriminations and dis- 
tinctions are required to be made appear to in- 
crease in difficulty when presented in illustration 
form. For example, in both item sets in which 
the examinee is required to discriminate between 
a dessert spoon and a teaspoon, a significant in- 
crease in difficulty of the illustration form was 
“+” items which require a close attention to 
detail there also appears to be acorr e sponding 
increase of difficulty level in illustration form. 
For example, in item sets in which the examinee 
is required to pay close heed to details which are 
somewhat confounded in the rep roduction of the 
illustration, there followed a substantial increase 
in the difficulty level of the illustrated item. - 

3. In items in which either clearly eta Se 
action or identification of Mos а ел cds = 

to be а decrease 11 2 #6 
the Ginctrated item. This was dee gt pim E 
the fifteen item sets which produced sta 


significant differences. 


Summary and Conclusions 

To test the hypothesis that co mer b oso 
items in achievement type tests anne of low 
the validity of measurement spa pi test forms 
verbal ability groups, two exp бе и а sample of 
were developed and administered to ге native- 
111 Navy Stewardsmen, 44 of whom > native-born 
born Filipinos and 67 of whom were contained 
American Negroes. Both test forms 


j owledges and 
the identical items, each ракав ward Маје, 


ski i for promotion 10 қ У 
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Both test forms were administered to the same 
subjects, and comparisons of performance were 
made both on a total score basis and for individual 
pairs of parallel items. Both experimental tests 
proved to be parallel forms. Reliabilities and va- 
lidities of each form were also identical. Of the 
50 item sets (pairs) 35 provedto be of equal diffi- 
culty level; andof the 15 which showed statistical- 
ly significant differences, nine were less difficult 
in illustration form, and six were more difficult. 
Also no significant differences were found to exist 
between the performance on both test forms of 
American-born and Philippine-born subjects; and 
between subjects of higher and lower verbal abili- 
ties. 

From the findings obtained in this study, the 
following conclusions appear to be justified: 


1. Varying the mode of presentation (from com- 
pletely verbal to partially verbal form) of achieve- 
ment type test items does not appreciably affect 
the predictive validity of the measurement when 
applied to members of the low verbal ability group 
studied. 

2. For the major portion of the material upon 
which this type of measurement is based, presen- 
tation in either form produces reasonably the same 
results. 

3. The required increase in time, effort, and 
nonetary costs involved in the development and 
construction of large numbers of illustration items 


would not be justified. 
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The Problem 


IN THE FIELD of perceptual psychology, it is 
sometimes held that behavior is a function of per- 
ception (15). 

Authoritarian behavior has been the subject of 
much research in recent years, and certain psy- 
chologists hold thatthose persons classed as ‘‘au- 
thoritarians”’ are less able to perceive reality, 
tolerate ambiguity, resist suggestion, and assert 
their independence than are ‘‘nonauthoritarians. " 
Accordingly, if behavior is a function of percep- 
tion, then these behavior characteristics of au- 
thoritarians may be products of the way they per- 
ceive. One purpose of the present study was to 
explore this possibility. 

Further, there is general feeling that an indi- 
vidual's cultural background exerts considerable 
influence on his personality development. If this 
view is correct, interest attaches to the way in 
which the culture exerts its influence. Is it that 
a particular cultural situation tends to develop in 
the individual certain perceptions, and in this 
manner influences the very fabric of his person- 
ality? In the present study, groups of persons 
from two different cultural backgrounds were in- 
cluded in order to obtain some evidence on this 


particular problem. 
Maslow (13) first outli 


Character in 1943, listing, 1 1 
some of the behavioral characteristics which were 


examined in this particular study. Adorno (1) and 
his colleagues devoted conside rable time and 
space reporting their studies of authoritarian 
personalities, out of which emerged the F (Fascist) 
scale, a measure of anti- democratic potential. 
Christie and Jahoda (3) examined the entire re- 
search procedure involved in the de velopment of 
the F scale and seriously questioned some of the 
techniques and conclusions of Adorno and his col- 
leagues. Titus and Hollander (17) reviewed over 
Sixty studies which usedthe F scale in paye 
ical research from 1950 to 1955 and conclude 

that the concepts of *tpersonality"' and ‘‘syn- 
drome” were without meaning. They further соп- 
Cluded that the use of the Е scale ав а PF edictive 


ned the authoritarian 
among other things, 


instrument was still questionable. Masling (12) 
reviewed four studies of authoritarianism and con- 
cluded that the characterization of the authoritar- 
ian had been overdrawn. Drucker (5) studied col- 
lege students and student nurses and reported that 
conformity was related to authoritarianism 
Koontz (11) compared white and Negro students 
from two Southern universities and found no sig- 
nificant differences regarding authoritarianism 
between the two ethnic groups. Further, he found 
only nine percent of the total sample were “їгце 
authoritarians,’’ and also that intolerance of am- 
biguity was probably not highly relatedto authori- 
tarianism. Christie and Garcia (2) compared stu- 
dents in California with a Southwestern city and 
found that students from the Southwestern city had 
higher F scale scores. Davids (4) studied 20 col- 
lege students and found no significant relationship 
between F scale scores and tolerance of either 
ambiguous visual or auditory stimuli. Frenkel- 
Brunswik (6, 7) pointed out that prejudiced persons 
were less able to tolerate ambiguity. Maslow (14) 
discussed *'self-actualizing people" and reported 
that such persons had clear perceptions of reality, 
could tolerate ambiguous situations, and were 
readily able to assert their independence should 
the situation so arise. Snygg and Combs (15) de- 
scribed an “adequate personality” in similar 
terms. Studies of visual perception of reality re- 
ported by Koffka (10) and Kelley (9) indicated that 
reality derived its characteristics from the total 
situation, including the perceiver, rather than 
from the stimuli alone. Finally, Stevens (16) said 
that the role of the listener needed to be studied 
by “varying the stimulus and/or by varying the 
listener. '' 

Three major hypotheses were developed in or- 
der to aid in the explorationof the specific aspects 
of the problem. Each of these major hypotheses 
had several related sub-hypotheses. These are: 


1. Persons from a Southern rural cultural situa- 
tion will be more authoritarian than persons 
from a Northern urban cultural situation. That 
is, persons from aSouthern rural cultural sit- 
uation will: 
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е higher on the F scale \ 
x = able to perceive reality 
с) be less able to tolerate ambiguity 
d) be less able to assert their independence 
e) be less able to resist suggestion 
1) have fewer Perceptions 


2. Girls will be more authoritarian than boys. 
That is, girls will: 
a) score higher on the F scale А 
b) be less able to perceive reality 
с) be less able to tolerate ambiguity 
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will be a positive relationship bet 
lowing factors: 


a) F scale Scores and sy 


ввез Ину 
b) F scale Scores an 


d fewer Perceptions 


bama and urban Michigan. 
The sound test cons isted Of а Series or ques. 
tions and thirty-four non-verbal Sounds recorded 
on magnetic tape. The questions Were designed 
to procure specific information (age, Sex, grade 
in school, etc.,) from each of the Stude : 


cipating in the study, Twenty sounds required а 


and style of the test, The tape was played at a con- 
Stant speed and volume, 

Ound required approximately ten sec- 
ying time and was followed Бу exac tly 


scribed, i.e., the student di d no 
kind of description in th 


Considered as “not Percej 


еасћ of which was 
played two times, required esponses in a more 
Structured Situation, Thatis, each of these sounds 
had four а] te i ipti 
Student соша 


hese was sug- 
gested as being correct, 


Actually, 'none of the al- 
re correct descriptions of the partic- 
Я although the last choi 
ething else » 


ternatives were correct 
ething else,” 


Y Correct si > апа this was only 
Since it diq " ae 
tual Source of the 50 Not describe the a 


und, the degree of certainty a 
Subject haq about these а Mbiguous alternatives 
Ог ће р 


| Purposes or this stud У 
Гана represented the соса description 
ап individuals US. Suggestibility represented 
Suggested г SSponse which corresponded to а 

езропзе for а Particular aural stimu- 
Tànce of am iguity represented the de- 
tainty individual над about his ini- 
TCeption upon hearing certain sounds 


іпа; ft tion of independence 
‘Ndi viduals Selection of the re- 


then, a percep- 
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sponse “something else” when presented with a 
particular aural stimulus and three other alterna- 
tive responses, none of which were correct, but 
one of which was suggested as being correct. . Fin- 
ally, those sounds which were not described were 
considered as not perceived. 

The F (Fascist) scale (1), a commonly used 
device in psychological research, was selected as 
an instrument to determine an individual's tend- 
ency to be authoritarian. The instrument em- 
ployed in this study consisted of 27 items admin- 
istered as а “Public Opinion Questionnaire, ’’ the 
subjects indicating the extent of their agreement 
or disagreement with the various items. High F 
Scale scores indicated a tendency toward author- 
itarianism. 

The final group studied consisted of 22 girls 
and 42 boys from urban Michigan, and 48 girls 
and 43 boys from rural Alabama. The mean age 
of the Michigan sample was 16.31 years, and the 
mean age of the Alabama sample was 16.40 years. 
All subjects were Caucasian and had lived ten or 
more years in their particular locale. Only the 
results for those persons who passed a pure tone 
audiometer screening test were included in this 
report. 

The subjects from Michigan came from a large 
metropolitan area in the sout heastern section of 
the state. The school, located centrally in this 
automobile manufacturing city, had approximate- 
ly 2000 students in grades nine through twelve. 
The student body was a heterogenous group rep- 
resenting different races, nationalities, and re- 
ligious affiliations, as well as deviations in social 
and economic status. Two groups of ninth-g г ade 
and two groups of twelfth-grade students from this 
area were included in this study. 

The subjects from Alabama came from a small, 
rural community situated about 55 miles southwest 
of Chattanooga, Tennessee, on top of what iss 
commonly known as Sand Mountain. The рорша= 
tion was exclusively white and the four churc E 
in the immediate vicinity were all Protestant. 
The enrollment of the consolidated school from 
which this group came was approximately х 
dents, grades one through twelve. парне 
percent of the school population were bus- ~ 
ported. The student body was essentially е 
mogeneous group, all white, and шэн ийш 
sively Protestant in their religious 4^! AI? 
Most of the students lived on small pm. at 
to forty acres in size. Two groups of ww -8 wile 
and two groups of twelfth-grade students from 
area were included in this study. 

The statistical differences between the е 
from these two cultural situations and as 
Sexes was compared by computing і for кү tie: k 
ing scores: authoritarianism as depicted ^y oi 
Scale, perception of reality, tolerance © ber ог 
Euity, assertion of independence, and num i5 di 
Sounds not perceived. Further, coefficien 
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correlation (r) between pairs of scores for stu- 
dents from both cultural situations, from both 
groups combined, for all of the boys, andfor all 
of the girls combined were also computed. 


The Results 


Table I shows a comparison of mean scores of 
the Southern rural and Northern urban group per- 
formances on the F scale, perception of reality, 
tolerance of ambiguity, suggestibility, assertion 
of independence, and sounds not perceived, as 
ascertained in this study. 

In general, persons from the Southern rural 
group scored markedly higher on the F scale 
(mean of 5.20) than did those persons from the 
Northern urban area (mean of 4.02). Further, 
those persons from the Southern rural group also 
had fewer perceptions of reality (mean of 5.74) 
than persons from the Northern urban group (mean 
of 8.47), were more suggestible (Southern rural 
group mean of 7.09, Northern urban group mean 
of 6.59), were less able to assert their independ- 
ence (Southern rural group mean of .34, Northern 
urban group mean of 1.47), and had fewer percep- 
tions, i.e., more sounds not perceived (Southern 
rural group mean of 4.44, Northern urban group 
mean of 2.25). Those persons from the Southern 
rural group, however, were more able to tolerate 
ambiguity (mean of 27.52) than were members of 
the Northern urban group (mean of 25.72), as 
measured in this study. All of these differences 
were statistically significant at the five percent 
level or higher except that one between mean scores 
on the suggestibility index. 

Table П shows a comparison of the mean scores 
of boys from these two areas combined and of the 
girls on these same factors. These data indicate 
that there is a tendency among the girls tested in 
this particular study toward a pattern of authori- 
tarian behavior characteristics, although none of 
the mean scores are significantly greater than 
those of the boys. Girls did score higher on the 
tolerance of ambiguity measure, but this differ- 
ence was not statistically significant either. 

Table III shows the coefficients of correlation 
between various pairs of the following scores: F 
scale, tolerance of ambiguity, perception of real- 
ity, assertion of independence, suggestibility, and 
number of sounds not perceived, as measured in 
this study. The correlation coefficients (r) were 
calculated for the Southern rural sample, the 
Northern urban sample, for both groups combined, 
for all of the boys, and for all of the girls. In this 
table those statistics which are underlined once 
( ) represent relationships in the predicted di- 
rection, whereas those which are underlined twice 
(С) represent relationships which are in the di- 
rection predicted and which are statistically sig- 
nificant at the five percent level of confidence. 

There are several observations about Table Ш. 
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COMPARISON OF MEAN SCORES OF SOUTHERN RURAL AND NORTHERN URBAN GROUPS 
ON SIX BEHAVIORAL CHARACTERI 


STICS 
I NP 
Group F R Am 5 

Northern Urban 4.02 8.47 25.72 6. 59 1.47 2. 25 
Southern Rural 5.20 5,74 27. 52 7.09 34 4,44 

t 10. 63» 4. 80* 2, 09* 1.24 6. 64* 4,81% 
F = F scale Scores Г = Assertion of independence 
К = Perception of reality NP = Sounds not perceived 
Am = Tolerance of Ambiguity в аа 
SM s 


Differences Significant at 5% level 
Suggestibility 


F =F scale Scores = С 
В = Perception of reality I = 68estibility 
Am = Tolerance of ambiguity 
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i ost important, is the 
as ace ee ёо үлде z Elation coefficients are 
و‎ even though several are stati stically 
, 
шо. їсї f correlation are 
e coefficients of с 

E to be some consistency p 

attern of relationships which may be wo rthy o 
“Ээ, em For instance, in the Southern ru- 
ral group elevenof the fifteen relationships are in 
the direction predicted, and six of these are sta- 
tistically significant. Ontheother hand, the North- 
ern urban group has only three relationships which 
are statistically significant, andone of these is in 


the direction opposite from that which was hypoth- 
esized. 


If both groups are com 
coefficients calculated for 


these Various factors for all of the 
girls it seems that there 1 


1S some very definite 
pattern ррагеп! since al] fifteen Coefficients are 
in the direction Predicted, a 


and sounds not р 


who selected the Suggested resp, 
lect “something else, » 


S are probably 
est itself, 


Discussion of the Results 
— 2L е Results 


Persons from the Southern гига] Cultural sity. 
ation scored higher on the F scale 

correct perceptions of reality, Were less able to 
resist Suggestion, andalso had fe 


ЭР to 
It seems that these data, then, are а 
accept, at least tentatively, the firs Sa A 
namely, that persons from a хоно өө emper 
ituation will be more au а у 
den и a Northern urban cultural қақ 24 о 
And although the girls scored higher 2 reality, 
thoritarian index, had fewer perceptions о өні 
were less able to resist 5 ug gestion, 2 «Көзге 
able to assert their independence, and un. 
perceptions, none of these scores differe и 
icantly from those of the boys. These cap erg 
insufficient to accept the second hypothesis, linn 
fore, that girls will be more тыа ат ша 
boys, even though there does seem to be за ср in 
idence of a pattern of characteristics tending 
uch a direction. 
3 Of the 75 coefficients of correlation шеше 
to portray the vari Ous relationships хэрэх эйч 
in this particular Study, 54 were in the эмэн 
predicted. Of this number, however, only 23 
were Significant Statistically, апдо these 26, nin 
were of a questionable nature (i. e., those between 
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ically that tenable experimentally, at least as 
measured in this study. The characterization of 
the authoritarian personality as it has been dis- 
cussed in other research seems to exist, but ina 
very indefinite and incomplete sense. It would 
seem that perhaps this theoretical characteriza- 
tion has been overdrawn, especially as regards 
intolerance of ambiguity. 


Conclusions 


Persons from different cultural situations 
seem to perceive the same sounds differently. * 
Those persons from the Northern urban area who 
participated in the present study apparently had 
more perceptions and were more accurate in their 
perceptions. 

Further, it seems that authoritarians are less 
“ореп” in the way they perceive aural stimuli. In 
other words, since they are less accurate, appar- 
ently have fewer perceptions, andare less able to 
assert their independence, among other things, 
those persons who are authoritarians do not seem 
to be as effective in the perceptive process as are 


nonauthoritarians. 
Conversely, the nonauthoritarians seem to have 


more perceptions, to be more accurate in perceiv- 
ing aural stimuli, and assert their ind ependence 
more effectively than authoritarians. Because of 
these specific differences in perceptual abilities, 
it appears that nonauthoritarians are more “ореп” 
to aural stimuli than authoritarians. 


Suggested Research 


rse of the present study several 
emerged. Some which the author 
e described: 


During the cou 
related problems ? 
feels deserve investigation аг 


ns from Northern rural and South- 


1. How do perso г 
tions perceive aural 


ern urban cultural situa 
stimuli? 

2. What is the rela ^ 

i horitarianism ? 

3. pend ihi talan teacher” and the “‘au- 
thoritarian personality” the same? 

4. Is there a difference between “direct”? author- 
itarianism and «¢manipulation’”’ 2 


tionship of tolerance of ambi- 
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GROUP STRUCTURE, ANXIETY, AND PROB- 
LEM-SOLVING EFFICIENCY 


FRANK W. BANGHART 
University of Virginia 


THE PURPOSE of this study was to compare 
the relative differences regarding the influence of 
anxiety on problem-solving, between cooperative 


and non-cooperative groups. 


Background 


There is evidence in the literature to suggest 
that anxiety level does influence learning and per- 
formance. Castaneda and others (4) infer that 
“the tendency for the high anxious subjects to per- 
form more poorly in co m parison to low anxious 
subjects increases as the difficulty of the task in- 
creases.’’ The difference found by Castaneda et 
al., was statistically significant at the . 025 level. 
The difference was based upon the high anxious 
child doing more poorly on difficulty tasks than 
did the low anxious child. However, the high anx- 
ious child apparently did no better than the low 
anxious child on less difficult tasks. Palermo et 
al., (6) studied the relationship between anxiety 
and performance on a complex learning task and 
reported that the non-anxious subjects were supe- 
ior to the anxious subjects in every block of trials. 
The suggestion has been made in the Palermo et 
al., study that “the effect of increases in motiva- 
tion on performance depends upon the relative 
strength of the correct and incorrect responses 
aroused by the experimental situation. Calvin 
et al., (3) report по relationship between anxiety 
and intelligence, but do report that anxiety does 
significantly inhibit performance. Sarasen (7) in 
a study of the effects of anxiety on two kinds of 
failure on social learning reported no significant 
difference bet ween the control and experimental 
groups. Yet Flanders (5) concludes that “student 
behavior associated with interper sonal anxiety 
takes priority over behavior oriented toward the 
achievement problem." Ausubel et al., (1) re- 
porting on a study dealing with anxiety and the 
learning process suggest that “һе low anxiety 
group was significantly superior to the high pom 
iety group on the first trial of the maze, pata 
Superiority was not maintained over the course 0 


*This study was made possible by а тез 
Research, Contract No. NONR 474 (8). 


earch grant from the 


ten trials. ” 

The above studies suggest that there - 
tors other than anxiety involved т вахони а 
anxiety is a significant influence. In other words 
anxiety per se is not necessarily the sole deter- 
mining factor, but, rather, it becomes a signifi- 
cant influence under certaincircumstances. It was 
the purpose of this study to investigate the rela- 
tive influence of anxiety when groups are arranged 
cooperatively and non-cooperatively. 


Procedure 


Twenty-four university students were given the 
Taylor Anxiety Scale. The subjects were then 
assigned to either a cooperative or non-coopera- 
tive experimental group. The basic difference be- 
tween the two groups being in the free exchange of 
ideas and information on the part of the coopera- 
tive group. No deliber ate communication was 
permitted for the non-cooperative group. 

The experimental apparatus [described in de- 
tail elsewhere (2)] consisted of ten colored lights 
arranged on a panel. The problem involved the 
subjects’ prediction of which light would come on 
next. The light series were predetermined and 
automatically controlled. The total experimental 
situation consisted of three problems; the first of 
which might be consider ed a ‘‘warm-up”’ task. 
Problem two consisted of a simple numerical so- 
lution (1, 3, 5, 7, 9); problem three required a 
place-color solution (red-red; green-green; etc.,) 
and involved a shift from a numerical to a color 
set. In terms of solution time and problem-solv- 
ing efficiency, problem three was approximated 
fifty percent more difficult than problem two. 


Results 


Table 1 summarizes thedata in terms of prob- 
lem-solving time and anxiety. Correlations 
were run between problem-solving time and anx- 
iety for problems two and three for cooperative 
and non-cooperative groups. Table I suggests 
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TABLE I 


CORRELATION BETWEEN PROBLEM-SOLVING TIME 
AND ANXIETY 


Non-Cooperative (P 2) 


Problem-Solving Time Anxiety г = .06 


Cooperative (P 2) 


Problem-Solving Time Anxiety г = .10 


р = .448 
Non-Cooperative (р 3) 
Problem-Solving Time Anxiety г = .04 
Cooperative (Р 3) 
Problem-Solving Time Anxiety г = .55 
р = . 047 


P 2 = Problem two 


P 3 = Problem three 
P = probability of derived correlation coefficient 


TABLE II 


CORRELATION BETWEEN PROBLEM-SOLVING EFFICIENCY 
AND ANXIETY 


Non-Cooperative (P 2) 


Problem-Solving Efficiency Anxiety r = - 35 
Cooperative (P 2) 
Problem-Solving Efficiency 


Anxiety r = -.45 


р = .352 
Коп-Соорегануе (Р 3) 
Р 


Foblem-Sol ving Efficiency Anxiety r = 01 
Cooperative (P 3) 
Problem-Solving Efficiency Anxiety r = -.48 


р = .068 
Р 2 = Problem two 
Р 3 = Problem three 


р = probability of derived Correlation Coefficient 
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that there was very little correlation (. 06 and .10) 
between problem-solving time for either the non- 
cooperative or cooperative groups on problem two. 
Since problem two can be considered to be an 
‘feasy’’ problem, the results are consistent with 
the literature on thesubject. However, our inter- 
est in this study was on the relative influence of 
anxiety between cooperative and non-cooperative 
groups. On the basis of this study no difference 
can be assumed (p - .448). 


For problem three, which was fifty percent 
more difficult than problem two, an interesting 
discrepancy exists between the two groups (r = .04 
for non-cooperative groups; г = . 55 for coopera- 
tive groups, р = .047). One possible explanation 
of the discrepancy between the two groups on prob- 
lem three might involve the thinking of the Flan- 
der's article (5). Thesuggestion that the non- 
cooperative groups were problem (ac hievement) 
centered, whereas the cooperative groups were 
interpersonal centered, the ass umption is that 
anxiety interferes more when interpersonal rela- 
tions are involved. This is speculation, of course, 
and needs to be empirically checked out. 


Table П summarizes the data dealing with the 
relationship between p roblem-solving efficiency 
and anxiety. Efficiency in this case is defined as 
the ratio of correct to total responses. 

Again, the correlations were run for problems 
two and three for cooperative and non-cooperative 


groups. 


Reference to Table П suggests some interfer- 
ence (г = -. 35 and -. 45) by anxiety with problem- 
solving efficiency. However, the relative influ- 
ence does по seem to differ significantly between 
the two groups (p = - 352). (Reference might be 
made to the relative influence of anxiety on prob- 
lem-solving efficiency and problem-solving time 


for problem two.) 

For problem three à difference again emerges 
(p = .068) between the two groups. Attention is 
called to the decrease of anxiety influence, for the 
non-cooperative groups, from problem one (easy) 
to problem two (hard). Again, any attempt to ex- 
plain the discrepancies between the two groups is 
in the realm of conjecture, yet attention should at 
least be called to other studies which might 1 üre 
nish a plausible explanation. A combination of 
the suggestions made by the Flanders (5) study 
and the Ausubel (1) study that the achievement 
centered subjects are less influenced by anxiety 
than are interpersonal relations centered subjects 
(Flanders), and thatthe discrepancy between шин 
and low anxiety subjects does not hold up m 
time (Ausubel) might account for the findings 0 
Tables I and П. Certainly some follow-uP of these 


Suggestions is indicated. 


Summary 


An experimental study was made, using twenty- 
four university students, onthe relative difference 
regarding the influence of anxiety on problem- 
solving between subjects assigned to cooperative 
and non-cooperative groups. Correlations were 
made between problem-solving time and anxiety 
and between problem-s 01 ving efficiency and anx- 
iety. Tests were made regarding the significance 
of the differences between cooperative and non- 
cooperative groups. 

The results suggest a minimal influence (r= 
ше ын 10) of anxiety on problem-solving time 

y problem for both cooperative and 
non-cooperative groups (p = .448). However, anx- 
iety seemed to have a more pronounced influence 
on the cooperative group than on the non-cooper- 
ative groups for the “һагФ” problem (г = .04 and 
.55; р = .047). 

In terms of efficiency, anxiety seemed to have 
about the same influence for both groups (г=.35 
and -.45; р = .352) on the “еаву” problem. 

However, on the ‘shard’? problem, anxiety 
seemed to be more influential with the cooperative 
group than with the non-coope rative group in 
terms of influencing efficiency (г = .01 and -. 48; 
p = . 068). 

At most, the results repo rted іп this paper 
should be considered as exploratory and tentative. 
Certainly much additional work needs to be done 
in this area. A word of caution should be added 
also regarding the technique of measuring efficien- 
cy by a gross device such as right responses to 
total responses. Presumably, one could get a 
better measurement of anxiety, if a greater dis- 
parity existed between high andlow anxiety scores. 
The present range (6 to 32, M= 13), with possi- 
bly too many scores around the mean, may not 
have been sensitive enough to discriminate between 
high and low anxiety. 

Again, it should be emphasized that more work 
needs to be done before anything beyond sugges- 


tion can be inferred. 
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I. Introduction 


THIS STUDY had its origin in the studies con- 
ducted by the Bureau of Indian Affairs and the Un- 
iversity of Kansas during the period 1951-55 (3, 
13). This Indian Education Res earch Project is 
undoubtedly the most comprehensive study of In- 
dian and white children which has been made. The 
program explored the educational status of Indian 
children and emphasized two functions: predictive 
testing and achievement ‘testing. The purpose of 
the predictive testing was an attempt to investi- 
gate high-school graduates’ academic skills and 
to predict their probable success if they consider 
further formal education. The achievement test- 
ing was designed to serve two purposes: (a) a con- 
tinuing evaluation of the s tatus of educational 
achievement of Indian children for administrative 
use, that is, the recording and interpreting of test 
scores for describing a pupil's educational 
achievement as measured by tests, and (b) the use 
of data obtained as a functional part of the instruc- 
tional program. These studies showed that Indian 
children differed from white children as measured 
by achievement test results and that the smaller 
the proportion of full - plooded and non-English 
speaking children in a specific group of Indian 
children the higher the group achieved. It was 
recognized that these differences in educational 
achievement depended on differences in ехрепеп- 
tial, environmental, and cultural factors and not 
necessarily on innate ability alone. | 

The present study was an attempt to isolate and 
compare the factor structures and sets of factor 
patterns obtained from the achievement test scores 
for full-blooded Indian children and white children. 
The purpose of this study was to investigate the 
degree of relationship between the subtests of the 
measuring instrument, to determine the number 
of independent factors which account for the scores 
On the subtests, how heavily each of the subtests 
is weighted with the factors, and which factors 
appear as a whole for the different groups of sub- 
jects. A comparison of factors by sex, grade 


a. 


*All footnotes will be found at end of article. 


level, and race was made. 

Although there has been agrea 
in the literature which co хилээ ш 
ا‎ ae achievement of American Indians, no 
че spe мэ evident in which factor analysis had 

pplied to achievement test scores for the 

purpose of comparing full-blooded Indian and 
white children onlarge samples. Perhaps the only 
attempt was made by Harris (33) in which corre- 
lations were obtained between 15 language skill 
variables in four groups of fifth-grade Indian stu- 
dents. The intercorrelations were too irregular 
for any definite factors to emerge. Experimental 
studies on intelligence with Indians (20, 26, 27, 28, 
29, 30, 34, 36, 39, 46) indicate that differences in 
test performance between Indian and white chil- 
dren are not necessarily due to differences in in- 
nate capacity. The evidence indicates that white 
children and mixed-blooded Indian children did 
better where language was vital. The results 
showed that Indians scored below age and that {е- 
males performed better than males on the intelli- 
gence tests used, but these results seem meager 
and inconclusive. 

Anthropological studies (37, 39, 40, 41) indicat- 
ed that there were no significant differences in 
results of performance intelligence tests for Indi- 
an children as compared to white children. Intel- 
igence tests of a verbal nature, requiring writing, 
resulted in lower 16/5 for Indian children. The 
educational achieveme nt of Indians in public 
schools, federal schools, and mission schools as 
measured by test scores resulted in a hierarchi- 
cal order, respectively (2,13,42,43,52). Indian 
children with highest degree of Indian blood and 
non-English speaking ability at the time of enter- 
ing school obtainedthelowest test results. It was 
believed that cultural factors played an important 
role in the learning process of these children. 

Factor analytic studies on intelligence and age 
with white children (1,4, 5, 7, 9, 14, 17, 19, 23, 24, 
25, 38, 44, 45, 41,48) emphasize d that a general 
factor was established early in life. However, 
conflicting evidence existed as to the first factor 
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i ith increasing age. Where factor pat- 
шоо аа in there was no consistent trend in 
evidence. Factor studies regarding levels and sex 
for white children (35, 49, 50, 51) revealed a gen- 
eral agreement as to no differentiation in factor 
patterns between boys and girls. There was indi- 
cation of conflicting evidence regarding the theory 
of the general factor playing a less important role 
as special abilities develop. The factor patterns 
change relatively little from grade level to grade 
level, and where changes did occur the patterns 
did not exhibit any systematic trend. Mental fac- 
tors seemed to depend on background, educati 
and experience of the subjects involved. 

Where educational skills for white children 
were factor analyzed (6,8, 10, 11,12 


on, 


age, and 5) bi 
educational Skills, it iffi 
component factors. 


Hypotheses 


nia Achievement Test Batteries, Th 
are: 


3. The factor loadi ill be 0008 by SR 
nitude for all groups, аша шарк. 


1 i ture 
4. Higher factor loadings of a verbal na 
will appear for white children than for full-blooded 
Indian children. . 
5. Higher factor loadings of a numerical пена 
will appear for white children than for full-bloode 
Indian children. 


6. А common factor will appear throughout all 
the subtests for all groups. 


П. Procedure 
ee 


The Testing Program 


A fall achievemen 


t testing program was con- 
ducted by the United 5 


tates Bureau of Indian Af- 
fairs and the University of Kansas during the per- 
tod 1951-55. The California Achieve ment Test 
Battery was used throu 
gram for grad 
dren attending 


kogee areas in 1954. The tests were administered 
by Indian Bureau and 


Personnel in th 


Proportion of 
were sent to 
tion, necessa 


All answer sheets 
Kansas for inspec- 
andscoring. The raw 
т each individual were 
я These were punc hed on 
with back ground information for each 


recorded and tabulated, 
IBM cards 
child. 


The Population 
а=) 


О graphical locations of the administra- 


ed States Bureau of Indian 
estin 


The ge 
tive areas 


» 4) Billings area — Montana 
yoming, 5) Muskogee өлен —eastern Okla- 
апа 6) Anadarko Е. 

Kansas. Between the ye 

1951 and 1954, inclusive, 13 636 Indian children 

White Children Were tested in more 
ived in ОП Schools. These 23, 608 с е 
dren lived In rural areas either on farms, ranches» 
5, or small communities. 
attending ET Bate of full-blo od a Indian childre? 
in = federal Schools and white children pet 
cix Public schools Combined in all areas jer 
jects, 10^, this study, The total number of 888 
Indian 52:88, were separat 10 as to full-bloo" 

children and White children by зех 


ississippi 
аћота апа 
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TABLE 1 


DISTRIBUTION OF SUBJECTS BY RACE, SEX, AND 
GRADE LEVELS 


Full-Blood Indian Children 


Elementary Level 
Grades |, 5, 6 


Intermediate Level 
Grades 7, 8, 9 


Advanced Level 
Grades 10, 11, 12 


Elementary Level 
Grades №, 5» 6 


Intermediate Level 
Grades 7, 8, 9 


Advanced Level 
Grades 10, 11, 12 


Total = 14,888 
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orm to the suggested grade 
Deme aed EN at evement tests, the grade 
4 were combined into elementary (grades 5 
5, and 6), intermediate (grades 7,8, dod ын 
advanced (grades 10,11, and 12). TableIs Ow: 
the breakdown of the subjects and the number in 
each group with percentages, The app roximate 
age ranges for the federal Indian children were 
from eight to 17 years for the elem entary level, 
10 to 19 years for the intermediate level, and 15 
to 24 years for the advanced level, For public 


age ranges 
he elemen- 


The Test Battery 


The California Achiey ement Test Battery, 
Form AA, was utilized to measure the Subjects’ 
achievement in fundamental readi 


were: a) reading Vocabulary, р) 
hension, с) 


fundamental b 


The Factorial Analyses 
= Analyses 


The raw scores 
on IBM cards. Tabul 


ions 
which indicated that the tests cou] d hay 
چ‎ MA Я 
Somewhat too difficult for the s te a 
However, the degree of Sk e wness was not suff; 
ciently large where the assumption 


of normalit: 
could not be met. A rather Consistent trend на: 


noted in Ше variability among races, неса, жне 
grade level groups. In general, the standar i 
viation is considerablylargeron the emi bes 
cabulary, and Arithmetic fundamentals tes E 
on the other four tests in the battery for а 
ious groups. Pearson product-moment ae 
tion coefficients were det ermined among the s 
tests by race, sex, andgrade level. Twelve m 
relation matrices for the various groups wer 
formulated, i.e., а Separate correlation matrix 
was constructed for males andfemales at the шин 
mentary, intermediate, and advanced grade шше, 
groups for both full-blooded Indian children ani 
white children. The correlations were all positive 
with the lowest correlation of r = . 35 between the 
Reading vocabulary test and the Arithmetic fund- 
amentals test at the advanced level for full-blood- 
For the sample size N = 551 
this correlation differs from zero at the .01 level 


the communalities, t 
Were used in the diag 


This metho 


OPriate since a central refer- 
ence vector existed oj 1 


› therefore i races coul 
not be drawn wh; would ар сэлт 
Васка of these factors. The € 

68141415, range -:0540.04, for full-bloo¢ 
ed Indian male 2 hildren at all levels are suffi - 
Clently smal] to yield а Meaningful third factor: 
the thirg factor loadings for this group 

iduals, aresult, only poe 
in obli ion for 

three gr ade level in ыны plane, #07 

was made. Tables УШ, : 
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TABLE II 


INTERCORRELATIONS BETWEEN THE RAW SCORES OF EACH TEST 
FOR FULL-BLOOD INDIAN CHILDREN BY SEX ON THE 
CALIFORNIA ACHIEVEMENT BATTERY 


Full-Blood Indian Children 


3, Arithmetic 


Reading Comprehension, 
5. Language, 


2qest.--l. Reading Vocabulary; 2. 


Reasoning, Le Arithmetic Fundamentals, 


6. Spelling 
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TABLE Ш 


INTERCORRELATIONS BETWEEN THE RAW SCORES OF EACH TEST 
FOR WHITE CHILDREN BY SEX ON THE CALIFORNIA 
ACHIEVEMENT BATTERY 


White Children 


Inter- 
mediate 


Note. --Decimals omitted for convenience. 


? Test. --1. Reading Vocabulary, 2: Reading Comprehension, 3. Arithmetic 
Reasoning, lı. Arithmetic Fundamentals, БЕ Language, 
6. Spelling 


— 
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TABLE IV 


CENTROID FACTOR MATRICES AND OBTAINED COMMUNALITIES 
WITH PROPORTIONS OF VARIANCES CONTRIBUTED BY 
THE CENTROID FACTORS (Fo) 
(MALE) 


Full-Blood Indian Children 


Mal 


е 
1 "Гг 
01 


үтү T a 


олы n Hourn 


jmals omitted for convenience. 
2. Reading Comprehension, 
ndamentals, 5. Language, 


3. Arith- 


Note, --Dec 


атеві.--1. Reading Vocabulary; 


metic Reasoning, ц. Arithmetic Fui 


6. Spelling 
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TABLE V 


CENTROID FACTOR MATRICES AND OBTAINED COMMUNALITIES 
WITH PROPORTIONS OF VARIANCES CONTRIBUTED BY 
THE CENTROID FACTORS (Fo) 
(FEMALE) 


Full-Blood Indian Children 


Pe Er p pM 


Qut Ev m ыјол\л r7 го How mw m H 


Note.--Decimals omitted for convenience, 


8Тез%.--1. Reading Vocabulary, 2. Reading Comprehension, 3. Arith- 


metic Reasoning, l. Arithmetic Fundamentals, 5. 
6. Spelling 


Language 


RUPIPER ЕЕ 


TABLE VI 


CENTROID FACTOR MATRICES AND OBTAINED COMMUNALITIES 
WITH PROPORTIONS OF VARIANCES CONTRIBUTED BY 


THE CENTROID FACTORS (Fo) 
(MALE) 


White Children 


ed for conveniences 
Е Comprehension, 


5. Languages 


Note. --Decimals omitt 
abularys 2, Readin 3. Arith- 


атов. --1. Reading Тос 
he Arithmetic Fun 


Е damentals 
metic Reasonings я 


% Spelling: 
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TABLE УП 


CENTROID FACTOR MATRICES AND OBTAINED COMMUNALITIES 
WITH PROPORTIONS OF VARIANCES CONTRIBUTED BY 
THE CENTROID FACTORS (Fo) 
(FEMALE) 


White Children 


actor Loadi 


ГІ ТІ | 
2 181 


5 


Advanced 


AWE m HAUL m HÎ ain Fw m H| 


Note.--Decimals omitted for convenience, 


8 ї685,--2, Reading Vocabulary, 2, Reading Comprehension, 3. Arith- 


metic Reasoning, |. Arithmetic Fundamentals, 5. Language, 


6. Spelling 


m — 5220 
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TABLE VIII 


MATRICES OF OBLIQUE ROTATION OF AXES FOR FULL-BLOOD 
INDIAN CHILDREN, MALES, ELEMENTARY 
GRADE LEVEL GROUP 


——— pasouse 
Unrotated Factor Rotated Oblique Factor 
Matrix V 


Matrix Fo 


Matrix S = Le Direction Matrix. Direction Cosines 
Coefficients of а Normal of Reference Vectors Np 
to Each Coordinate Plane 


Matrix 6 LANA. Совдпев of 
Matrix Eo 


Angles Between Reference 
Vectors АРЧ 


3. Arith- 


mprehension و‎ 


amentals, 5. Languages 


Reading Vocabulary» 2. Reading Со 


ђе Arithmetic Fund: 


aTest:--l° 


metic Reasoning; 


6. Spelling 
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TABLE ІХ 


MATRICES OF OBLIQUE ROTATION OF AXES FOR FULL-BLOOD 
INDIAN CHILDREN, MALES, INTERMEDIATE 
GRADE LEVEL GROUP 


ЕЕ —— 
Unrotated Factor Rotated Oblique Factor 
Matrix Fy Matrix V 


Matrix S = Le Direction Matrix A. Direction Cosines 
Coefficients of a Могта1 of Reference Vectors Ap 
to Each Coordinate Plane 


АЕ 

Matrix С + ЛА. Cosines of 

Angles Between Reference Matrix Eo 
Vectors A ра 


?Test.--1. Reading Vocabulary, 2, 


Reading Comprehension, 3. Arith- 


metic Reasoning, |, Arithmetic Fundamentals, 5. Language, 


6. Spelling 


— 
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TABLE Х 


MATRICES OF OBLIQUE ROTATION OF AXES FOR FULL-BLOOD 
INDIAN CHILDREN, MALES, ADVANCED 
GRADE LEVEL GROUP 


—— —— 
Unrotated Factor Rotated Oblique Factor 
Matrix Fo Matrix V 


Matrix Л. Direction Cosines 


Matrix S = Le Direction 
Coefficients of а Normal of Reference Vectors Р 
to Each Coordinate Plane 
Facto А Б 
„917 | «9М. 
Extended Factor 
Matrix Ёо 


ctors Л РА 


abularys 2, Reading Comprehension, 3. Arith- 


ће Arithmetic Fu 5, Languages 


--1. Reading Voc 


Test. 
ndamentalse 


metic Reasoning» 


6. Spelling 
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mm я‏ پپپ 
Rotated Oblique Factor‏ 
Matrix V‏ 


Extended Factor 
Matrix Eo 


Matrix S = L. Direction у Matrix Л. Direction Cosines 
Coefficients of а Могла] 


of Reference Vectors Ap 
to Each Coordinate Plane 


SS ————— 
Matrix С «ЛА. Cosines ог 


Matrix Т. Direction Со- 
Angles Between Reference Sines of Primary 
Vectors Л Pq Vectors T 


Р 


aTest.--1, Reading Vocabulary, 


2. Reading Comprehension, 3. Arith- Matrix TT! = Ера, Cosines 
of Angles Between Primary 
metic Reasoning, |. Arithmetic Fun- Vectors Tp? 


damentals, 5. Language, 6, Spelling 


PFactor A should read Factor С 
Factor С should read Factor A 
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TABLE ХП 


TATION OF AXES FOR FULL-BLOOD 


MATRICES OF OBLIQUE RO 
FEMALES, INTERMEDIATE 


INDIAN CHILDREN, 
GRADE LEVEL GROUP 


| کے‎ ———————— 
Extended Factor Rotated Oblique Factor 
Matrix Eo Matrix V 


Matrix 5 = Le Direction Matrix ^ . Direction Cosines 
Coefficients of а Normal ` of Reference Vectors /\P 
dinate Plane 


to Each Coor 


Matrix 0 „АЛ. Cosines of ти E onm Co- 
Angles Between pcc Сена s 


Vectors 


аТев%.--1» Reading Vocabulary» === 

Matrix TT! = Вра. Cosines 

rehension, 3. Arith- a T елден а тана 
Vectors Тр 


2. Reading Comp 
m Arithmetic Fun- 


metic Reasoning» 
6. Spelling 


damentals, 5» Languages 


192 JOURNAL OF EXPERIMENTAL EDUCATION 


GRADE LEVEL GROUP 


nro dl c 


Extended Factor Rotated Oblique Factor 
Matrix E, Matrix V 


Matrix S = І, Direction Matrix Л. 
Coefficients of a Normal 
to Each Coordinate Plane 


Direction Cosines 
of Reference Vectors Л р 


جس 


Matrix Олед ‚ сыз. of 


Matrix T, Direction Со- 
Angles Between Reference sines of Primary 
Vectors A pq Vectors Tp 


*Test.--1, Reading Vocabulary, ЦИК ------ 


Matrix TT! « Кра. Cosines 
2. Reading Comprehension, 3. Arith- of Angles Between Primary 


Vectors Tp 
metic Reasoning, |, Arithmetic Fun- 


damentals, 5. Language, 6. Spelling 


PFactor A should read Factor 0 
Factor C should Tead Factor А 
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TABLE XIV 


MATRICES OF OBLIQUE ROTATION OF AXES FOR WHITE 
CHILDREN, MALES, ELEMENTARY 
GRADE LEVEL GROUP 


РЕ 


————— 
Rotated Oblique Factor 


Extended Factor 
Matrix Eo Matrix V 


Matrix S = L. Direction Matrix Л . Direction Cosines 
Coefficients of а Normal of Reference Vectors ^p 
to Each Coordinate Plene 


Matrix 7. Direction Co- 


Matrix С = NA. Соѕіпев of с 
Angles Between Reference sines of Primary 
Vectors /\pa Vectors Tp 


Reading Vocabulary; 


атев%.--1. 
; доп, 3. Arith- Matrix ТТ! = Ера. Cosines 
2. Reading Сотргећепзіоп» 3 of Angles Between Primary 
Vectors Tp 


metic Reasoning, lı. Arithmetic Fun- 


damentals, 5. Languages 6. Spelling 
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TABLE Xv 


MATRICES OF OBLIQUE RO 


TATION OF AXES FOR WHITE 


CHILDREN, MALES, INTERMEDIATE 
GRADE LEVEL GROUP 


жни 
Extended Factor 
Matrix Ej 


Matrix S = L. Direction 
Coefficients of a Normal 
to Each Coordinate Plane 


<----------- 
Matrix С = АЛ. Cosines of 
Angles Between Reference 
Vectors Apq 


ЗТезћ,--1, Reading Vocabulary, 
2. Reading Comprehension, 3. Arith- 
metic Reasoning, |. Arithmetic Fun- 
damentals, 5. Language, 6, Spelling 


پپپ 
Rotated Oblique Factor‏ 
Matrix V‏ 


Matrix A. Direction Cosines 
of Reference Vectors Ap 


ЬЬ 
Matrix 7, Direction Со- 


зіпез оѓ Ргілагу 
Vectors Тр 


س 
Matrix TT! = p • Cosines‏ 


of Angles Between Primary 
Vectors Тр 


— 


MATRICES 
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TABLE XVI 


OF OBLIQUE Е: 
CHILDREN, 


OTATION OF AXES FOR WHITE 
MALES, ADVANCED 


GRADE LEVEL GROUP 


سے 
Extended Factor‏ 
Matrix Eo‏ 


Matrix S = Le 


Coefficients 9 
dinate Plane 


to Each Coor' 


Test. --1• 


2. Reading Солргепепзі! 
ke Arithmetic Fun- 


metic Reasoning» 
6. Spelling 


5. Languages 


damentals, 


ony 3• 


Reading Vocabulary» 
arith- of Angles 


———— 
Rotated Oblique Factor 
Matrix Ч 


Matrix Л. Direction Cosines 
of Reference Vectors Ap 


Matrix T. Direction Co- 
sines of Primary 
Vectors Tp 


Matrix TI! = Ера. Cosines 
Between Primary 
Vectors Tp 
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TABLE XVII 


MATRICES OF OBLIQUE ROTATION OF AXES FOR WHITE | 
CHILDREN, FEMALES, ELEMENTARY 
GRADE LEVEL GROUP 


Extended Factor Rotated Oblique Factor 
Matrix E, 


Matrix V 


Matrix A, Direction Cosines 
of Reference Vectors Ap 


Matrix Т, Direction Co- 


Sines of Frimary 
Vectors Тр 


estesi, Reading Vocabulary, 


2. Reading Comprehension, 3. Arith. 


metic Reasoning, ША Arithmetic Ғап- 
Language, 


bp, 
actor 4 should read F, 
Factor C should read ма 1 


damentalg, 5, 


6. Spelling 
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TABLE ХУШ 


MATRICES OF OBLIQUE ROTATION OF AXES FOR WHITE 
с MALES, INTERMEDIATE 


HILDREN, FE 


GRADE LEVEL GROUP 


—————À 


Extended Factor 
E 


Matrix S = Le Direction 
Coefficients of a Normal 
to Each Coordinate Plane 


Matrix © «АА. Cosines of 
Angles Between Reference 
Vectors Apa 


атезь. --1• Reading Vocabulary, 


2, Reading Comprehension, 3. Arith- 


metic Reasoning, lı. Arithmetic Fun- 


damentals, 5. Language, 6. Spelling 


————— 
Rotated Oblique Factor 
Matrix V 


Matrix A . Direction Cosines 
of Reference Vectors Р 


Matrix T. Direction бо- 
sines of Primary 
Vectors Tp 


Matrix TT! = Ера, Cosines 
of Angles Between Primary 
Vectors Tp 
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TABLE ХІХ 


MATRICES OF OBLIQUE ROTATION OF АХЕ 
С 


5 FOR WHITE 
N, FEMALES, ADVANCED 
GRADE LEVEL GROUP 


Extended Factor 
Matrix E, 


Direction Cosines 
of Reference Vectors Ap 


ЛА. Cosines of 


Reference 
ectors Лра 


Should reag Factor С 
d 


read Facto» A 
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i unrotated factor loadings, the ro- 
ат diis pee loadings, the direction coeffi- 
cients of the normal to each co- or dinate plane, 

transformation matrix, and the matrix of co- 
е of the angular Separations of the Oblique 
ор full-bl ooded Indian children at the ele- 
па intermediate, and advanced levels, re- 
spectively. 


were not included in the pattern plan. The ele- 
mentary and advanced grade level groups for fe- 


groups, were interchanged by post-multiplying by 
the permutation matrix, Correspondi 
terchange in Matrix В, q Was obtained by pre-mul- 
tiplying and post-multi] 


co-ordinate plane, Matrix S, when normalized 


tated factor matrix by the transformation matrix 
yielded the rotated oblique factor Matrix у, The 


r A to a Factor B 

loading at the intermediate and advanced levels, 

A and » factors A and C, and factors В and C While Factor A vanished оп Test 6 at the advanced 

е cosines between the reference vectors are level For full-blooded Indian female children a 

Shown in Matrix C of Tables ҮШ through XIX, and rather similar pattern existeq among the three 

the cosines between the primary factors are shown grade leve] groups. The “Staircase effect" was 
in Matrix Rpq of Tables XI through 5 e in- Present at al] levels with 

tercorrelations were 


the eleme ntary group 
Оп Test 5 and no sig- 


* A similar pattern was 
noted at the intermediate lev 


i tors, 
These correlations were all signifi 
ent from zero, i 
factors were not independent and 
Ship existed among the m. The hi Бтежег pat- 
1 У Уав Present, In relation to sex, 
dren, but Сауе were similar for white chil- 


Wed more inconsistenc for the 
1-bloodeq Indian Children, 4 


all cases, Tests 3 and 4 had Signific 
of Factor B which was also Signific 
to the other two factors, Th 
dicated the ргезе nce ofa Order factor 
common to all the tests, 

In order to facili 
groups, factor patterns of the rotated 
tor loadings on each of the six tests a. 
ed in Table ХХ, Factor loadings of 1 


: Sed in this Study were all 
d-penci] Variety, and since the tests pur- 
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desirable, but restrictive because no other meas- 
urements were available for the subjects studied. 

In view of the tests’ content, Factors А and С 
were identified as verbal factors and Factor B 
was identified as а numerical factor with a com- 
mon factor underlying all the tests, which might 
be identified as а SC hooling factor. It would be 
inappropriate to attempt an interpretation of the 
abilities necessary to Pe rform the tasks called 
for by the various tests. The tests with more 
verbal material, Reading vocabulary, Reading 
comprehension, Language, and Spelling did not 
remain clustered a$ well as did those tests with 
number mate rial, Arithmetic reasoning, and 
Arithmetic fundamentals. 


Evaluation of Hypotheses” 


Hypothesis 1-А comparison between the mean 
raw scores computed by the critical ratio showed 
a significant difference, .05 le 
on all tests for the three grade level groups in 
favor of white children. with regard t 


ferences, the mean raw scores 
gnificantly higher, .05 level of 


confidence, than for male white children with one 


exception: At the advanced gra 
surpassed fe 
test. In the maj 
scores were Sig 
fidence, for full-blooded Indian female с i 
The mean differences in favor of full-blooded In- 
dian male children were on the Reading compre- 
hension and Arithmetic reasoning tests at the el- 
ementary level, Reading У ocabulary, Reading 
comprehension, 
the intermediate level, and Reading voc abulary 
and Reading comprehension at the advanced level. 

Hypothesis 2— Two factors were found for full- 
plooded Indian male children at the elementary, 
intermediate, and advanced grade levels. These 
were identified as averbal anda numerical factor. 
relations between these factors sug- 
nd-order general factor. Three fac- 


ority of cases the mean raw 


children and for male an 
e level groups. 


at each of the three grad : 
is 3— The factor loadings ranged from 


“te children. More high 
eared for full-blooded Indian male 


ing tests- А 
шз eth i 5— Full-blooded Indian male chil- 
d higher loadings of а number factor 


on the arithmetic tests than did white male chil- 
dren. Generally, the size of the loadings of this 
factor increased with increase in grade level. 
For full-blooded Indian female children the load- 
ings were highest at the intermediate level and 
lowest at the elementary level, whereas, for the 
white female children the loadings were highest at 
the advanced level and approximately of the same 
magnitude at the elementary and intermedi ate 
level with moderate loadings- 

Hypothesis 6—The correlation matrices for all 
groups analyzed showed highly significant positive 
relationships. This suggested an underlying com- 
mon factor. The common factor was substantiat- 
ed by the significant intercorrelations existing be- 
tween the reference and the primary vectors. 


Iv. Discussion 


There is little evidence г e garding differences 
in sets of factor patterns and factor structures by 
race and sex for the eleme ntary, intermediate, 
and advanc ed grade level groups. Although the 
results did not conform ideally to Garrett's (25) 
differential hypothesis W hereby abs i 
hanges in organization from a general abil- 


ity to anorganized group of abilities or factors 25 
rithmetic tests showed rather 


the grade 1 
vanced level. A similar t 


was based on cross-sectional rather 
jtudinal data. When the correlations be- 


tween the reference vectors and the primary vec- 
sidered, in most cases the correla- 


ased with inc rease in grade level. 
relations were significantly different 


it should be pointed out that with only six variables 
the reference frames would not be too stable, 
therefore, the factor loadings might be subject to 
change from sample to sample. It was difficult to 
adequately establish the traces on the test projec- 
tions with only six tests andcrite rion tests with 
established factors for the subjects in this study 
were not available. 

The high positive correlations among the tests 
and among the primary vectors strongly suggest- 
ed the presence of a factor in the second-order 
domain. Apparently, this accounted for the low 
loadings on the first-order factors. It may be аз- 
sumed that the tests are rather complex in that 
they not only involvea specific factor, put also 
the common factor defined by the tests. Tests 1, 
2,5, and 6 appeared to be more complex than 
Tests 3 апа 4. The reading and language tests in- 
volved two factors in addition to the common fac- 


rr‏ للل 


i i t to 
чсаНопај Skills Since these Skills are Importan 
aces i 


: w- 
both r tside an educationa] s Hace LA 
ever, th WO races do ubtedy place h they 
Values on these fundamentar Skills even n of 
both f ion ino егісап Society. Beca imd 
а differen, егасНоћ between the — 
Vidual’s чсан vironment and his сш 
i 


Seems reasonable to assume that ue 
Children аге по аз Cognizant Of the ultimate duis 
for these i ducationay Skills, Other than с Ре 

егепс 8, slow Work habits, the т 
the test ite 5, poor motivation, level of aspir 
tion, and lac i t 


of interes тау а150 account for 
Some of the qj feren untereg. е ques- 
i S to wheth tests were equal- 
ly fair to übted]y. these сола 
tions Could h fected th result 1S study 


i і апзаз ducted а fall 
en esting rogram f an and white 
Childre. t the ej mentary, intermediate and ad- 
rade levels for the Uniteg ев Bureau 
Affairg ring the Period 195 -55. Th 
ia Achieve nt Test tterieg Were used 
П Scores ducationa Chi ent in the 
basics ill rea Ing voca ry, readin; 
Tehensig | агїйшен Teasop; » arithmetic 
11215, е, ang Pelling, en the 
Ores of tests у Pared as to 
"писал differe er d in favor 
it Children 
ample у, 5 limiteg all ull-bloodeg 1 
Children and White Chil en hom a сот- 
t of test Scores Was avai] le, е distri 
f 14, 888 Subjects used i this Study 
91 5502 p ~bloodeg In hildren ang 
е Childre е Вёоргар loc ations 
nistr tive аһ as of nite States 
ian airs Which testing was 
еге; 1 hoenix, 2) b Ччега » 3) 
» 4 Billin Musko ее, and 6) Ana- 
re ісі. поели “°F relations between 
еге с Puted for ene various Вгайе ley- 
by rac Sex. The twelye intercor- 
tric ere factor 
епітоід т 


Nalyzeq by Thur- 
actors were sub- 

У means of extended 
With | e 


e Concept of simple 

Tee factors Tesulteg, ith oblique 
hirg factor Was not Significant for 
dian e Children, Or relations 
Sfer Vectors а d the Primary 

re btaineg actors C were 
Tbal f Ors an, Factor B as a nu- 
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merical factor with highly significant correlations 
between the factors. This indicated a second- 
order factor common to all tests. 

The purpose of this study was to compare the 
sets of factor patterns and factor structures by 
race, sex, and grade levels. Conclusions from 
the factor analyses follow: 


1. Two factors were found for full-blooded In- , 


dian male children at each of the three grade lev- 
el groups, while three factors were found at each 
grade level for full-blooded Indian female children 
and for male and female white children. 

2. Higher factor loadings were found for full- 
blooded Indian male children than for any other 
race-sex group. The factor loadings rangedírom 
.20 to .65 for full-blooded Indian children and from 
.21 to .57 for white children. 

3. In most cases the facto: 
al nature were higher for 
male children. 

, 4. In general, 
ical nature were hig! 
children except on 
where white female children at t 


had the highest loading. 
5, The sets of factor patterns for all groups 


were more similar than they were different. 
6. Acommon factor of the second-order do- 
main was evident throughout all groups. This 


factor is probably Schooling. 


r loadings of а verb- 
full-blooded Indian 


the factor loadings of а numer- 
ner for full - p1 ooded Indian 
the arithmetic reasoning test 
he advanced level 


FOOTNOTES 


* This monograph is based on the author’s dis- 
sertation completed at the University of Kansas. 
The data used in this study were obtained from 
the Indian Education Res earch Project conduct- 
ed by the United States Bureau of Indian Affairs 
and the University of Kansas during the period 
1951-55. Mr. L. M. Coombs, Education Spe- 
cialist, United States Bureauof Indian Affairs, 
offered suggestions which were essential to the 
study. Dr. K. E. Anderson and Dr. A. H. 
Turney, Committee Members, read and eval- 
uated the manuscript. pr. R. N. Bradt, De- 
partment of Mathematics, provided helpful ad- 
vice on the statistical and factorial analyses. 
Mr. C. Bradrick, Director of the Statistical 
Bureau, compiled the data for computation of 
the correlations. The author is indebted to 
these persons for their cooperation and assis- 

he author is deeply indebted to his 

r. E. Gordon Collister, not 


only for his excellent direction and advice, but 
also for his encouragement and inspiration. 


жж Now at the College of Education, University of 


Oklahoma. 


1. The writer is indebted tothe staff of the Statis- 
tical Bureau, University of Kansas, for carry- 
ing out the necessary machine operations for 
these factor analyses. 


2. The following scale was used for the evaluation 
of factor loadings: 
Insignificant loadings .00 to . 20 
Low loadings .20 to .30 
Moderate loadings .30 to . 50 


High loadings .50 and above 
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ATTITUDE OF KINDERGARTEN CHILDREN 
TOWARD SPECIALLY PREPARED 
STORY MATERIALS’ 


BILL SNIDER ** 
State University of lowa 


THE DEVELOPMENT of materials for helping subjects for this study. Class A consisted of 41 


the child become sensitive to the dynamics of be- children in the morning апа afternoon classes of 
havior has been one of the aims of the Preventive teacher A, and Class B of 61 pupils in the morn- 
the Child Wel- ing and afternoon classes of teacher B. 


Psychiatry Research Pro gram in 


fare Research Station at the State University of In the method of paired comparisons, the sub- 


resented a number of different stimuli in 


Iowa. Among the material prepared is а group of ject is p 

narratives designed to help the kindergarten child pairs and is asked to judge which is greater with 

develop a causal orientation toward his social en- respect to some characteristic, such as, which of 
es are to be an effective two objects is heavier, which is prettier, or which 


vironment. If these stori 
d to a caus- of two activities he prefers. In this particular in- 


e young chil 
vestigation each child was asked which of two kin- 


al way of thinkin it seems necess 

children should enjoy listening to them. The pri- dergarten activities he preferred. By statistical 

mary purpose of this study, therefore, was to analysis of paired compar isons it is possible to 
rten children liked to assign а 5 et of numbers to the different stimuli 


discover how well kinderga. 


listen to and discuss causally orien! the subjects have been asked to judge. | 


Six different kindergarten activities were stud- 


specially WF itten for their age. The statistical 
procedure chosento determine this was the meth- ied, namely: having music, working puzzles, hav- 
od of paired comparisons. а ing the teacher read ordinary or regular kinder- 
, The purpose of the causally oriented story is garten stories, playing house, painting at the easel, 
to introduce the child to the differences between a and having the teacher read the causally oriented 
non-causal or surface approach and a causal ap- primary stories. These kindergarten activities 
proach to human situations. In the narrative, à were combined in all possible pairs. Each child 
social situation is presented and someone begins was taken privately by 2 trained person! and asked 
to act in a non-causal мау. The problem is then which of each pair of activities he preferred. 
ceeds more The first few minutes with each child was spent 


re-examined and the individual pro 
fects of this type of ac- in establishing rapport and 


tion are considered The reading of the story is derstood the difference between the two types of 
a discussion of differences be- stories being used as stimuli. The regular kin- 

proaches and some of the probable dergarten stories, often read by the teacher, con- 
sisted of a pile of several books from the book ta- 


effects of each. 
ple. The causally orientedstories were designat- 
Description of the Investigation ed as the book without any pictures from which the 
teacher read to them; a copy of this book was on 


year 1954-55, a number of the table so that the child could recall it. A final 
in the Preventive Psychia- check was made by asking the child if he knew the 
sally oriented difference between the stories in this book without 
hers had tak- any pictures in itandthe stories here—pointing to 


During the school 


teachers participated 1 
try Program (5) by pres enting cau: 


materials to their classes 

en part in an intensified training program through the books from the table. The administrator di 4 

summer workshops set up to train them to ргас- not let the child see the books from the table other 
c approach The pupils than the stack as the child was to respond to the 


books from thelibrary table rather than to specific 


reparation of this paper was supported by the Grant Foundation. 
ve Psychiatry Research Program, Child Welfare Research Station. 


entary Supervisor for the Preventive Psychiatry Program, 1955-56 
, 


ж Тһер ! 
** Address: preventi 
1. Mrs. Helen Marie Clark, Elem 


administered the test. 


ии 
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i d, likewise, the book containing the 
2. опеке stori es was only shown to the 
child without permitting him to leaf through it. 

The presentation of the pairs of stimuli were 
presented in such a manner as to adhere to the fol- 
lowing principles as closely as possible (10, 11): 


1. Maintain the greatest possible spacing be- 


tween pairs involving any given member of 
the stimulus group. 


2. Present a stimulus equally often first and 
second. This is referred to as taking ac- 
count of “space errors, ” 


3. Present stimuli as пеаг] 
ner as is possible so as 
effects. 


У in the same man- 
to avoid Suggestion 


1. Which would yourather do, listen to a story 
from this book (the Causally oriented Stories) 
or listen to a story from this book (regular 


Stories)? (In each case only pointing to the 
book or books. ) 


2. Which would yourather do, 


l have music time 
when you Sing or work with 


а puzzle? 


3. Which would your 


ather do, Play house or 
paint at the easel? 


Analyses of the Data 
SSS 0: the Data 


The first Step in the analysis o 
the six kindergarten ac tivities 


II give the proportion matrices f, 
Classes A and B respectively, 


proportion. For example, in Table I th 
tion of the time ‘Music’ 
Sel Painting" is . 463. 

of time “Playing Ho 
“Regular Stories” is ‚ 623. 


Rj - Ry = Zik V SE Sk = 2F jkSjSK 
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where Rj and Ry = mean psychological val ues 
characteristically attached to stimuli Sj 
and 5х respectively, 

Тік = Standard measure distance ог deviate from 
the mean of a unit normal distribution, 

5) and sk = standard deviations of distributions 
Rpj and Вук respectively, and 

Tjk - coefficient of correlationbetween Rhj and 
Rhk- 


The radical term is th е standard deviation of the 


differences Rhj - Rhk is expressedas afunction of 
the terms under the radical. 


In Thurstone's Case Ш, he as Sumes rj; = 0, 
that there is no correlation between responses to 
any pair of stimuli. Under this assumption the 
law of comparative judgment reduces to the form 


Rj - Ку = ZjkV si + Sk 


We find the Zik val 


Burros (1) Proposed some i 
с formulas 
for estimating the relati es for the at 


dard 
deviations: ve values for the stan 


Class A, ) 
с is estimated by the formula 
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TABLE I 
THE SIX KINDERGARTEN ACTIVITIES IN TERMS OF PREFERENCE 


PROPORTION MATRIX FOR 
AS JUDGED BY THE FORTY-ONE PUPILS IN CLASS A* (MATRIX Pa) 
Kindergarten Regular Playing Working Easel Causal 
Activities Stories Music House Puzzles Painting Stories 
Regular Stories . 500 . 561 .561 . 537 . 610 .902 
Music .439 . 500 ‚101 ‚561 2537 ‚132 
Playing House .439 .293 .500 .659 .561 .780 
Working Puzzles 463 439 341 500 . 659 .683 
Easel Painting 390 463 439 341 . 500 . 659 
Causal Stories . 098 268 _. 220 917 .341 500 
2.524 2.768 2.915 3. 208 4.256 


Zpj>k 2.329 А 2 
ncreasing Zpj» k 


*Kindergarten Activities have been arranged in order of i 


TABLE I 


TIES IN TERMS OF PREFERENCE 


PROPORTION MATRIX FOR THE SIX KINDERGARTEN ACTIVI 
B* (MATRIX Pp) 


AS JUDGED BY THE SIXTY-ONE PUPILS IN CLASS 


| 
Playing Working Easel Causal 
Painting Stories 


Regular 


Kindergarten 
| Activities Stories Music House Puzzles 
| - 
Regular Stories . 500 ‚514 ‚623 . 656 .T81 .918 
Music .426 . 500 .459 „459 . 656 705 
| Playing House 2877 ‚541 ‚500 . 607 .525 “693 
| working Puzzles . 344 . 541 .393 . 500 . 627 ` 623 
Easel Painting 213 . 344 ‚415 ‚328 . 500 .443 
Causal Stories .082 .295 ‚311 ‚311 .557 ` 500 
.942 2.195 2.827 2.927 3. 697 3.812 


*Kindergarten Activities have 


Zpj»k ч 
| рееп arranged in order of increasing =Pj>k across the columns. 
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where п is the number of stimuli, and Уу, is 
defined as above. ; 


Substituting the Zjk and Mzjx values in Table 


Ш in the above formulas we can estimate the 
standard deviations for the six kindergarten activ- 
ities as judged by the forty-one children in Class 
A. Table IV gives the statistics needed for com- 
puting Burro's formula for the Class A group. 

The sum of the reciprocals of the variances is 
55. 379 and the number of stimuli is SIX, giving а 
c-value of .1083. As 4 check on the computations 
the sj values should sum to n, the number of stim- 
uli. This is due to the fact that this method as- 
sures a mean Sj of 1.000. 

Table V contains the (Rj - Ry) values which 
are computed ассо rding to the assum tions of 
Thurstone’s Case Ш, Rj * F^ ЭХ 81 + Sk and 


the Rj scale values are based upon these assump- 
tions. The Е; value is а method of arbitrarily 
assigning the Z ero point to the lowest mean and 
adding the amount necessary to make the lowest 
mean . 000 to eachof the mean values. Either the 
mean values Or the Ва values can be taken a5 
scale values for the different kindergarten activ- 
ities. Guilford (2) presents а method of locating 
a psychological meaningful zero point; however, 
this requires addtional data. 


The sum column in Table V is used as a means 


the table, the sum 


of the sums should equal zero, the sum of the col- 
umns means shoul d also equal or approximate 
zero, the sum of the Rj’s should equal n times the 
number added to each mean Mz jk: or .353n, where 
n is the number of stimuli. 

Given the scale values in Table V, the next 
problem 15 to determine how well the obser v ed 
proportion of pupils' judgment agree with the ex- 
pected proportion as determined by the above scale 
values. This test of internal consistency is made 
by utilizing Mosteller's m ethod (8) by which a 
single chi-sqare test is made for the obtai n e d 
and expected proportions. The first step is to find 
the expected 2! к values. These are obtained by 


applying the following formula: 
Rj - Rk 


2 
22 
sj + i" 
where the Rj- Ry values inthe numerator are ob- 


tained from the interpair scale differences given 
in the Rj row in Table V The denominator is the 


square root of the sum of the paired 5175 in the 
last row of Table IV. 


These Z'ik values аге giv- 

en in Table VI. Only half the matrix 15 neede 
e identical in absolute val- 

onal cells have been ignored as 
there are по actu y obtained values. 
The expected proportions р', given in Table 
уп, are the areas under the normal curve for the 


2/ ik = 


ues. The diag 


respective 2' jk values in Table VI. Н 2! should 
be negative the area under the smaller portion of 
the curve is used, whereas when 2' is positive the 
area under the larger portion is used. 

The test for internal consistency as proposed 
by Mosteller (8) does not use the actual p and р' 
values since many of the proportions might be 
close to 1.00, whichis the region where sampling 
distributions of p roportions are not normal even 
for reasonably large samples. In order to com- 
pensate for this lack of normality, each propor- 
tion is transformed to а statistic 0, whose заш- 
pling distribution is norm al. Theta being the 
angle whose sine is the Ур, or 9 =arcsin ур. Та- 
ples УШ and ІХ give the Q-values for the obtained 
and experimental proportions respectively. The 
chi-square form applicable for this test of intern- 


al consistency 15 


N 
chi-S are 13:48 
i-squar 321 (8 - 6") 


where N is the number of judgments per stimulus 
air, 9 is the arcsin Y p and 6" is the arcsin Ур”. 
Mosteller (8) gives the degree of freedom for the 
above chi-square as 


dis (n - 1)(n - 4) 


where n is the number of stimuli. Thesum of the 
squared differences between the observed and ex- 
pected theta is 259.6927, and the number of pupils 
is forty-one; therefore 


Chi-square = 410259. 6921) 


ағ = 5 


Chi-square = 12.969 


Since the chi-square is not significant at the 

one percent level one could assume internal con- 

sistency between the observed and expected pro- 

portions for the preference of the six kindergar - 

ten activities as judged by the forty-one children 
in Class A. The scale-values for these activities 
are the Rj values given in Table V, namely: Reg- 
ular Stories .000, Music .017, Playing House 
.259, Working Puzzles .266, Easel Painting . 524 

"та Causal Stories .1.053. The zero point is ar- 
pitrarily assigned. 

Analysis of the p reference of the six kinder- 
garten activities for Class B, using Thurstone’s 
Case Ш assumptions yielded slightly different re- 
sults as obtained for Class А. Table П gives the 
proportion of time each activity was preferred 
over its respective pair. The transformation of 


these proportions into R; - Ry val 
j y values where R; - 
Ry = Zik" 81 + St and the resultant scale - values 


are presented in Table X. Testing of the obtained 
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TABLE VI 


BETWEEN THE SIX KINDERGARTEN AC TIVITIES DERIVED 


EXPECTED SC ALE SEPARATIONS z' 
D BY THE ASSUMPTIONS OF CASE III (MATRIX 2 A) 


FROM SCALE VALUES OBTAINE 


Kindergarten Playing Working Easel Causal 
Activities Music House Puzzles Painting Stories 
Regular Activities 2011 . 308 ‚214 „805 1.161 
Music . 160 „141 . 238 . 671 
Playing House . 006 . 152 . 844 
. 131 . 599 
. 299 


Working Puzzles 
Easel Painting 


TABLE УП 


EXPECTED PROPORTIONS р' DERIVED THROUGH г" FROM SCALE VALUES FOR SIX KINDER- 
GARTEN ACTIVITIES, AS JUDGED BY PUPILS IN CLASS A, UNDER THE ASSUMP TION 
OF CASE Ш (MATRIX Р' A) 


Working Easel Causal 
Stories 


Playing 


Kindergarten 
Activities Music House Puzzles Painting 
Regular Stories ‚504 . 621 . 585 . 620 . 878 
Music . 564 . 556 А 594 А 749 
Playing House . 502 . 560 . 801 
„552 „195 
. 618 


Working Puzzles 
Easel Painting 
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TABLE УШ 


OBTAINED PROPORTION S FOR THE SI 


X KINDERGARTEN AC TIVITIES CONVERTED INTO ANGLES 
IN TERMS OF DEGREES (6) Ву MEANS OF THE FUNCTION 9 - ARCSIN Ур ( 


MATRIX 9) 
Kindergarten Playing Working Easel Causal 
Activities Music House 


Puzzles 


Painting Stories 
Regular Stories 48. 50 48. 50 47.12 51.35 71.76 
Music 57. 23 48.50 41. 12 98.82 
Playing House 54.27 48. 50 62. 03 
Working Puzzles 54. 27 95. 73 
Easel Painting 


04. 27 


8 values for Class A. 


TABLE Ix 


EXPECTED PROPORTIONS FOR THE SIX KINDERGARTEN ACTI 
HTH OF nas BS NDR TON АСЫН E SD INTO ANGLES 


Kindergarten Playing Wasi. 
don i Orking Easel Causal 

Activities Music Puzzles Раш ting брзе ев 
Regular Stories 45. 23 92.00 49. 89 51, 94 69. 56 
Music 48. 68 В о А 
Playing House e 50. 42 59. 93 
Working Puzzleg > 48. 45 
Easel Painting 


0" values for Class А. 


Фолна 
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and expected proportions by Mosteller’s method 
showed internal consistency of the data at the one 
percent level, the level chosen to reject the hy- 
pothesis. The scale-values computed for the six 
kindergarten activities as derived by means of 
paired comparisons and judged by the s ixty -one 
pupils inClass B were Regular Stories .000, Play- 
ing House .333, Mus іс .389, Working Puzzles 
.481, Causal Stories „860, and Easel Painting 
.931. Two pairs of activities reversed position 
when compared with scale-values for the kinder- 
сагїеп activities as judged by the children in Class 
A, namely, Music and Playing House, and Easel 
Painting and the Causal Stories. 

As another part of the analysis of the investi- 
gation, the data for Class A and Class B were 
pooled and scale-v alues for the combined group 
were calculated. Table XI gives the obtained pro- 
portions for the combined group. 

Thurstone's Case Ш assumptions were applied 
to the proportions obtained by pooling the data. 
The Rj - Rk values under these assumptions are 
presented in Table XH plus the scale-values for 
each of the kindergarten activities. 

The chi-square test for internal с onsistency 
was not significant at the one percent level. The 
scale-values thus obtained by Thurstone's Case 
IIl assumptions for the comb 
Regular Stories . 000, Music .188, P 
.305, Working Puzzles .366, Easel Painting . 816, 
and Causal Stories . 894. The zero point is an 
arbitrarily assigned value. 


Conclusions and Summary 


The purpose of this experiment was to find out 
how wellki ndergarten children liked to listen to 
the causally oriented stories specially written for 
their age. Subjects were forty-one kindergarten 
children in Class Aand sixty-one in Class B. 

tions were used. Af- 


his activity with five other kinder- 


garten activities b 
the results showed that 
causal stories was preferred by Class A and the 
combined group over the other five kindergarten 
activities, namely, having Music, ' 
zles, Painting at the Easel, playing House, ог 
Having Regular Kindergarten Stories read to 
them. Тһе si g nificance of internal consistency 
was tested by using the one percentlevel of гејес- 
tion. A similar analysis of the paired сотраг!- 
son data for Class Bshowed a reversal in the 
preference of Causal Stories and Easel Painting. 
Easel Painting rated first and causal narratives, 
second. : | 

Thus, using the materials designed to help the 
child develop 4 causal orientation toward his 50- 
cial environment ranked well with other activities 
included in this investigation. 
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Table ХШ isa summary of the scale values 


assigned to the six kindergarten activities as 
judged by pupils in Class A, Class B, and the 
Combined Classes A and B, using Thurstone’s 
Case Ш assumptions for scaling the paired com- 
parison data. Г 
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11. 


12. 


13. 


. Burros, В. Н. 


‚ Guilford, J. Р. 


. Levitt, Е.Е., andOjemann, В. Н. 


‚ Mosteller, Е. 


. Mosteller, Е. 


. Mosteller, F. 
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ANALYSIS OF MMPI DATA FOR CLASSIFICA- 
TION PURPOSES BY MULTIVARIATE 
STATISTICAL TECHNIQUES’ 


PETER P. REMPEL 
State University of Iowa 


е 


„ THE PURPOSE of this paper is to present the 
steps involved in the appli cation of multivariate 
analysis tothe assignment—with aminimum error 
of classification—of N people to q criterion groups 
on the basis of all information provided by k in- 
tercorrelated measures. 

The study itself stems from the writer’s at- 


tempt to identify 45 potential delinquents or поп- 
delinquents (а = 2), apopul ation of ninth-grade 
high-school boys (N= 1802) to whom the MMPI had 


been administered by Hathaway (3) in 1947-1948, 
and for whom police and/or court records were 
available through 1952. Severity of misdemeanor 
made possible a classification of delinquency into 
eight levels. Because of inte r-group similarity 
of MMPI profiles between nondelinquents and de- 
linquents at levels one and two, the latter (N = 363) 
have been omitted from this study (see Table I). 

Variables used in one part of the study, re- 
ferred to as the main study, consisted of the 
scores on the F, Hs, Sc, Mf, and Pd scales of the 
MMPI (k = 5); in another part, the school study, 
they consisted of the Ma, Sc, Mf, and Pd scales 
of the MMPI; honor point ratio, and numbers of 
days absent in Grade УШ (k = 6). 

In the main study the MMPI profiles of 351 boys 
who had achieved delinquency status by 1952 were 
divided by г andom procedures into two groups: 


sample A (N = 175) which was used i 
ment of the classification formula and Sample B 
ross- validation pur- 


(N = 176), set aside for С 
poses. Two sam ples 0 
similarly selected from 

school study poth delin- 


MMPI profiles. Inthe | 
quent and nondelinquent 6 Г oups were divided in- 
to Samples А" and В' andfour sub-groups (N = 91 


were thus similarly obtained. 


Tests of Assumption 
5 obtained for the statistics Bi 


Numerical value: 
re of skewness, 


and g» —the former used as a measu 


*Submitted to the Graduate School of the Unive 
ments for the ph. D. degree. 


the latter as a measure of kurtosis—indicated that 
roughly the distributions of only half of the vari- 
ables tested in either study (in Samples A and A' ) 


satisfied the assumption of normality. Somewhat 


more satisfying results were obtained from tests 


of homogeneity of variances and covariances. Thus 
approximately 30 percent of the tests indicated 
lack of homogeneity of variances at à statistically 
1, and 20 percent indicated lack of 


significant leve 
homogeneity of covariances. The effect of non- 


normality upon the distribution of D^ and the LDF 
statistic is not known. Atest of the assumption of 
linearity of regression (4:240-245) was omitted be- 
cause the number of variables involved made the 
amount of labor required too excessive. Inspec- 
tion of correlation charts of selected pairs of var- 
iables made it appear unlikely that anything other 
than linearity existed. Because cross -validation 
has been included, the application of multivariate 
analysis was thoughtto be justified even though not 
all of the assumptions have been fully met. But in 
drawing conclusions from this study, one must 
recognize the fact that not all of the assumptions 


for the application of multivariate procedures were 


fulfilled. 


Discussion of Statistical Techniques 


The generalized distance function contr ibutes 
both to the problem of testing significance and of 
estimation. The five (or six) variables usedin the 
main and school studies respectively, may be con- 
sidered as defining а space of five (or six) dimen- 
sions so that any individual with assigned scores 
can be represented by a pointin such a space. The 
individuals belonging to any group will then be rep- 
resented by a cluster of points round another point 
which represents the mean values of these scores 
If the mean values of the two groups are close to- 
gether, then the clusters of points correspondin 
to them will overlap to some extent. Rao (6) А 
shown that the degree of overlap between any two 


rsity of Minnesota in partial fulfillment of the re quire- 
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TABLE I 


EVEL OF DELIN- 
ON OF VALID MMPI PROFILES BY DELINQUENTS, L 
рш QUENCY, AND NON-DELINQUENTS 


Percent Percent 

(Based on (Based on 

Classification Number N = 1802) N = 714) 
Levels 1 and 2 363 20.15 50. 84 
Level 3 231 12. 82 32. 35 
Level 4 47 2.61 6. 58 
Level 5 31 1:42 4. 34 
Level 6 22 1.22 3. 08 
Level 7 12 0. 67 1. 68 
Level 8 8 0. 44 1,12 
Total Level 3-8 351 19. 47 49.16 

Non-delinquents 1088 60. 37 


Total (All delinquents 
and non-delinquents) 


A SUMMARY COMPARISON OF с 
WITH CHANCE EXPEC TATION 


ND 
GROUPS WITHIN THE SAMPLE 
Correct Classi- Correct Classi- 


Si Percentage 
шоп by fication by Improvement 
— s alg Chance Over Chance 
Š % 
N % 
Sample A - Delinquents М = 175 120 68 
Sample А - Non-delinquents N = 172 15 65.7 Bre 50.0 Ч үн 
Total Sample A N = 350 235 671 175.0 а M. je 
Sample B - Delinquents N= 176 124 
Sample B - Non-delinquents N = 175 103 53 сэн 50. 0 20. 5** 
Total Sample в N = 351 221 647 Mia Шо 1472 
All Groups Total N = 701 462 65.9 350.5 50.0 18. нэ 
* Significant at the 5 percent level 


**Significant at the 1 Percent leve] 


-= 
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such clusters can be measured by the quantity 


i= j=5(6) ;; 
pee 599 5 © aa 
1-1 іі 

where 9142;...› 45 (ог 46) аге the differences 
between the mean values of the measurements in 
the two groups, and ай are the elements of the 
matrix reciprocal to the within group dispersion 
matrix (ај). The higher the value of D?, the 
smaller the amount of overlap, 50 that this meas- 
ure may be regarded as the ‘‘distance’’ between 
the groups. 

The calculation of D? is given in ће basic 


as 


form 


р? = 242; 510944} (1) 
atrix inverse to Sij; the 


sij- elements of the m 
s уапапсе-со- 


dispersion ОГ within group 
variance matrix. 


mean of the ith 


where Жұ represents the sample 
n of a second 


variable and Xj is the sample mea 


category of the ith variable. 
The ““рев!” (unbiased) estimate of the sample 


variance used in the matrix for the ith test is 
Nx Ny 2 
ов- 12 Жи = х? + DJ Yik - NY 2 
i T> ік - Мх +, ік i (2) 


and the unbiased estimate of the covariance be- 
comes 


fined by the ith test (or 


the random variable de- 

fined by the ith test (or 

scale) in the second group, 

Х: = the score of the kth indi- 
и vidual in the first group 

on the ith test. 

Y; the score of the kth indi- 
x vidual in the second group 


Үү(-1,2,--29) = 


Ш 


on the ith test, 


Xi = the sample meanof the ith 
N test in the firstgroup, and 
Yi = the sample mean of the ith 


test in the second group. 


Rao (9) metthe problem of eliminating the cor- 
relation between variables by suggesting а method 
of transforming the original set of data to an ог- 
thogonal set of variates. Themost efficient meth- 
od of accomplishing this purpose is the application 
of pivotal condensation to the calculation of D?, 
and is the method used in this study. A system- 
atic and straight-forward illustration of the use of 
Rao's pivotal condensation technique is found in a 
paper by Collier (2). 

The appli cation of pivotal condensation to the 
problem of transforming correlated variates to a 
new orthogonal set reduces the covariance ele- 
ments in the 51) variance-covariance matrix to 
zero, thus eliminating the element of correlation 
from matrix calculations. The general distance 
function may then be expressed as 


ужар 
2 1 
р = Ec 4 (4) 


in which *511, the variance of the transformed ith 


variable is found easily in the process of pivotal 
condensation, and each #912 гер resents the соп- 


*Sii 
tribution of the ith variable to the total О2. 


Significance of Added Variables 

Rao (7,8,9) has con sidered the following ap- 
proximate test of significance which can be used 
for testing both the significance of the total D? val- 


ue as well as of the addition to р? of each variable 


separately: 
а (#-р-а+1) njn2(D^ р+а- D^ p) (5 
а f(ny+ng)+nin2D%p ) 


where f = the degrees of freedom of the disper- 
sion matrix of var iance and covariances between 
the ith and jth characters 


number of variables in 2 


а = number of variables added to form Оба 
= size of sample or groups from the first 

population considered 

по = size of sample from the second population 


T has the F-distribution with а and {-р-а+1 de- 
grees of freedom. Use of this value enables the 
research worker to determine the effect of adding 
variables to the matrix. 
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The Linear Discriminant Function (LDF) 
mha Linear ^er Roo 


ї 1 һегеі-1 

t Хүүс be a typical observation \ : 

2 stes K the sample, j for the trait measured , 

and k for the individual. Then the value Y, would 
ђе determined by the function 


Yik=>1X1 + хоХо +... + ^pXp (6) 
and 
TEES S q 


where Zi denotes summation over the ith sample 
(i= 1,2). The problem is to maximize the ratio 


* Үз - 5 


уа ак) + ZoYok- Yo)" (8) 


which may be written as , 


(9) 


ван 1 я 
where dj = по Хор - m 1Xijk for the jth quan- 


tity (j = 1,2,..., p): and Sim is the pooled sum of 
products of deviations from the sample traits j 
and m. Maximization of the above ratio requires 
essentially setting the derivative of the ratio equal 
to zero, and we obtain 


5 5 8 
хх i 

tee Б 

m=1 j=1 а 
A 240) 


(10) 


р 
* А Хр (r = 1,2,...,р) 


Since the first factor in the above equation will be 
the same for all equations it can be ignored. 
Therefore, the coefficients required are obtained 
through the solutions of the normal equations 


S111 + 5127. ..+ Sipp 


= 4, 
Spi + 80252... + Бррхр = др 


СРИ ТИ И 


(11) 


The diagonal cells in the above matrix represent 
variances, while all other cells consist of th 
propriate covariances. D? 
expression " 


l e ap- 
is represented by the 
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p 
D. 50% (12) 
в, ка 
and the discriminant function by 
р 
: LDF < 2 Xf (13) 
% » i-1 iXi 


* 


Classification Procedures 


The application of Fisher's discriminant func- 
tion to the problem of classification into one ог 
two known multivariate normal populations is dis- 
cussed by Anderson (1). Suppose that an individ- 
ual I with а set of measurements Хі,..., Ха has 
been randomly drawn from one of two populations 
Ш and H2, with probability distributions of the 
measurements 141(Хі,...,Ха), respectively. 
Suppose further that X1,..., Ха Ваза joint normal 
distribution with means іп Ш of ЕХ; = uj (1), and 
in По of EXj = uj (2). Let the common set of var- 
iances and covariances be 


2 2 
01)... бр; 012» 013) ++ “Ор-1, р. 


It can then be shown that the best regions of clas- 
sification are of the general form 


Kiosk Ее 
Ry қазағы Xi 2 m + )«logk 


(14) 
| р 1 2 
R2 EA < E: ta aug b atom (15) 
P 
where X1,... › Хр is the solution of 
p 
И. ВИ. У 
Е ооу щщ) « (8) 
and log k = log L(1 | 2)ро 
211 P1 
wh i 1 i 
е 1) is defined as the loss incurred in 


individual 1 into Ш i = 

1 2 when in fact һе be 
diront МЕЛІ 12) as the loss incurred in 
belong to По. idual I into Цу when in fact he 
ME eee 9f Formulas 14 and 15 is the 
Formula 13) a Which the LDF simplifies (see 
variance m if the population expectations and со- 
atrix are known. Therefore if k= 1, 


i.e., if the probabili у i 
is equal осу of belonging to one group 


ng from the other, then log k = 0, 

ын пе procedure is reduced to a comparison of 
a individual observations with the LDF 
of the averages of the respective means. If a pri- 
ori probabilities are not Specified. loc t 


ie Fanna 


р 
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so that the risk when I is from Ші is equal to the 
risk when I is from П2. For this purpose let 


Зээ -ppa (u +0) = Оо-о) 07 
1 


Since a linear function of normally distributed 


.. variables is again normally distributed, the dis- 
7% ~ tribution of U is likewise normal, with the mean 
and variances found as follows: Е 
| а а iy "(2 
nu) = Йэ ED ue ru) ав 
» LI 1 
: 1 (2) 
and oy = DXi (uj ). ur y (19) 
Let ? за .. (20) 
90, ы 
(1) 
Then if Iis from Пу, E(Xi) = Ш and 
а 1 2 
во 399 030-0) t 00 


where a is the ‘‘distance’’ between Пү and По or 
the variance of О. If Iis from По, E(Xj) = uj! ), 
and 


(2) _ 2 


q 1 
E(U) = Yu ш" ) - uj = -фа (22) 
The probability of misclassifying ап individ- 
ual belonging to II1 is then determined by refer- 
ring the following relation toanormal distribution 
table: 


1 к-4 
46: аавын) TA 


The probability of misclassifying an individual 
belonging to Па is determined by referring the 
' following relation to a normal distribution table: 


| 46 > век) (24) 
а 


a 
In both cases k = По I. ЕШ = I then k = 1, 
and log k = 0. Then the formula for calculating 
the theoretical probability of misclassification is 


n reduced to 
22 о-(-та) 
Р -——— 
( Ёо» Ta 
- The overall probability of misclassification is 
then the sum of products of the population propor- 


tions in each group and the respective misclassi- 
fication probabilities. 


(25) 


* 


* 
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Test of Effectiveness of Linear 
Discriminant Functions A 


Lubin (5) has given us the tests of the eff ec- 
tiveness of the LDF inclassifying individuals cor- 
rectly. Ina two-contingency table let the columns 
Ај and Ag represent the observed categories while 
the rows B1 and B2 represent the predicted cate- 
gories. If perfect discrimination is achieved, only ` 
the diagonal ceils will be filled. For pure chance 


discrimination 
Aj Bj 
ni = =L (26) 
N 
» 
where 
nj = frequency of the ith row and the jth 
column ? 
А marginal total of the ith row 


Bj = marginal total of the jth row 
= total number of individuals involved. 


In case the value of chi-square computed on the 
diagonal values is found to be significant (and a 
chi-square test is a necessary but not sufficient 
test to demonstrate classification beyond chance), 
then the one-tailed t-test for large values of N, 
may be used to demonstrate the effectiveness of 
the LDF: 


(о - УМ 


t= Г - е) (27) 


where о denotes the number of individuals classi- 
fied correctly, and e the number of correct clas- 
sifications expected by chance. 


Computational Procedures 


In the main study the variance-covariance ma- 
trix representing the equation 


5 (а) (2) 


а CU 


was found to be as shown at top of nex 
This set of simultaneous Їїнээс на + АЈ Ї 
Sample A was solved by the method of pivotal A 
densation. This method reduces the ori inal ав 
of variables to anew orthogonal set of m = 
lated variables, and makes possible a test P xm 
nificance of the contribution of each variable t E 
The numerical values of the beta weights d - 
individual contribution of variables to D? а es 
еса in the middle of the next page 
e value of D? computed from t 

these weights and the difference in eren ie ^ 
variable was found to be .9173 (i. е., the етих 


аге аз 
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Е Hs 5с м Ра цб. цд 
Р 11.9238 >] + 1.9755 x2 + 17.1353 хз + 3.5074 4+ 8.4253 х5 = 1.5029 
Hs 1.9755 ҳу + 77.1411 Хо + 38.6356 ^3 + 19. 5887 Ха + 30.8370 ^5 = 2.2743 
Sc 17.1353 х1 + 38.6256 Х2 + 104. 1166 ^3 + 28. 5664 ^4 + 50.1619 ^57 4.9714 


Pd 8. 4253 ху + 30.8370 x 


3. 5074 х1 + 19. 5887 Ха + 28. 5664 ^g + 72. 7225 ^4 + 16.9858 ^5 = -3.5029 


2+ 50.1619 ^g + 16. 9858 ^а + 99. 6932 ХБ = 


6. 6629 


Beta 
Scale Weights 
F = .0695 
Hs ^2= .0116 
Sc X3 = .0256 
Mf ^4 = -. 0782 
Pd Х5- .0578 


was obtained by adding the contributions of vari- 
ables to D? above). 


The first part of the expression 


р р 
0525 Xi З ЫН (uD, uj (2) 
i= 


іі! ) 
is obviously the LDF, and 01 scores in 


Sample A, its n um erical value Will be Written in 
terms of the beta weights for рг ав follows: 


for any set 


р 
LDF =È Xi Xj = . 0695%Х 1 + + 0116X5 + + 0256Х 4 
јЕ 


а constant quantity, с, 
© is the sum of the pr 5 
and the sum of means, or 


€ = 2 | (.0695)(12. 309) + (.0116)(103. 4514) , 


Е 0256)(119. 0628) + (-. 0782) (103. 6629) + 


(- 0578) (121. 6914)] = 2. 0169 


Contribution of 
Variables to D? 


.1894жж 
.0534ж 
.0556* 
. 3750** 
+ 2439%% 


D? = 9173 


and 


the delinquent sa 


turns out to Бета; for the non- 


delinquent sample 
the mean ig - 1 т 


Ву re-working Formulas 23 and 24 above, 
through the use of Formula 25, the probability of 


nisclassifying an individual from either рорша- 
tion is found to be . 3161, Thus 


» Or a= 0,9173, and af = 0.9578. For 
mple therefore, the mean of U 


= ће numerical value of these 
found to be а = . 4586 and -а = 
-.4586, 
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TABLE IV 


UMM. FICATION EF 
Y COMPARISON OF CLASSI 

d WITH CHANCE EXPECTATIONS FOR SAMPL 

DELINQUENT AND NON-DELINQUE 


FICIENCY FOR THE LDF ,.. 
ES А" AND B' IN THE SCHOOL STUDY B 
NT GROUPS WITHIN THE SAMPLE 


Correct Classi- 


Correct Classi- Е 
fication b fication by Improvemen 
Forma” Chance Over Chance 
Samples Used N % N LN 
i ۴ 0 15. 9* 
le A' - Delinquents N= 91 60 65.9 45.5 50. 
ae A' - Non-delinquents N= 91 12 19.1 45.5 50. 0 29. эн 
Total Sample А" N = 182 132 12.5 91.0 50.0 22. 
Sample В! - Delinquents N= 91 63 69.2 45.5 50.0 19. он 
Sample В' - Non-delinquents N= 91 63 69.2 45.5 50.0 19.2 
Total Sample В' N = 182 126 69.2 91.0 50.0 19, 2* 
All Groups Total N = 364 258 70.9 182.0 50.0 20. 9* 
*Significant at the 1 Percent leve] 
TABLE У 
CLASSIFICATION OF DELINQUENTS BY LEVEL OF DELINQUENCY USING LDF PROCEDURES FOR 
туо SAMPLES OF THE SCHOOL STUDY: SAMPLE A N WHICH PR 
WERE DEVELOPED AND SAMPLE В' ON 


EDIC TION FORMULAE 
WHICH THE FORMULAE 


WERE APPLIED 
THROUGH CROSS-V ALIDATION 
Sample А' 


59. 62 69 65. 71 

30 9 60. 00 12 15. 00 21 67. 74 

5 21 9 15. 00 6 66. 67 15 71. 43 

6 13 5 71.43 5 83. 33 10 76. 97 

7 7 1 50.00 3 60. 00 4 57.14 

8 5 1 50.00 3 100. 00 4 80. 00 
Total 182 63 69. 23 60 65. 93 
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Pg = .50 
р2/р1 = 1.00 
0.00 


р 
k! 
k 


loge 
and П] = По = 22900089) ог . 3161 


which is equal to 31. 61 percent. 
This means, in effect, that the theoretical 
probability of correct classification for both de- 


linquents and non-delinquents is 68. 39 percent. 


This overall percentage is essentially the same as 
that achieved by chance when the ratio of non-de- 
linquents to delinquent in the population is known 
to be 80.5 to 19.5, as it is in this study. How- 
ever, classification of delinquents by the LDF is 
more than three times as effective as chance pro- 
cedures, the formula identifying 68.4 percent of 
the delinquents as against 19. 5 percent identified 
by chance. 

In actual practice, however, the population 
distribution is not known. In such a case, pure 
chance procedures make possible the identifica- 
tion of a maximum of 50.00 percent of each of the 
two populations. 


Empirical Cross- Validation of 
Samples A and B 


The actual effectiveness of thelinear discrim- 
inant function was tested empirically by applying 
the formulas developed on Sample A to the delin- 
quent and non-delinquent sub-samples in both 
Samples A and B. A study of cross-validation re- 
sults as presented in Table II indicates that the 
overall percentage of correct classifications, 
namely 65.9, fell somewhat short of the predict- 
ed percentage of 68.4. The fact that the formula 
correctly identified only 58.9 percent of the non- 
delinquents in Sample B accounts for most of the 
shrinkage. However, 68.6 percent of delinquents 
in Sample A, and 70. 5 percent in Sample B were 
correctly identified. 

Classification res ults were also broken down 
in terms of delinquency level, as presented in 
Table III. The tabulation shows generally in- 
creasing proportions of correct classification with 
level of delinquency, level 5 being the only excep- 
tion. 


Results of the School Study 


The set of linear equations composing the var- 
іапсе-соуагіапсе matrix for the six variables 
used іп the school study is presented at the top of 
the next page. 

The following is a list of the essential statis- 
tics derived: 


LDF = .0219Х1 + .0282Х2 - .0430X3 + 
.0398Х4 - .5247Х5 + .0564Х6 
с = 2. 1955 


а = 1.2387; Va = 1.1129; +а = . 6194 


Using Formula 25 the probability of misclassi- 
fying an individual from either population is found 
to be equal to 


ЭРГЭЖ 0.0 - (-. 6194), _ 
ш = I2 =р(2> У) = 


р (Z >. 5566) = . 2906 


or approximately 29.1 percent, or a probability 
of correct classification for both delinquents an d 
non-delinquents of 70. 9 percent. 

Results of cross-validation procedures (see 
Table IV) indicate that classification by the LDF 
permitted the correct i de ntification of 65. 9 per- 
cent of the delinquents in Sample A' and of 69.2 
percent in Sample B'. Corresponding figures for 
non-delinquents are 79.1 percent and 69.2 percent 
for Samples А" and В" respectively. In each case 
improvement over chance is significant at the one 
percent level. 

Although the two studies are not strictly com- 
parable, the additionof school record data raised 
the overall predictive efficiency of the LDF from 
65.9 percent in the main study to 70.9 percentin 
the school study, a difference significant at the 5 
percent level. 

Classification results іп terms of the level of 
delinquency are presented by means of Table V. 
Again, an increase inthe proportion of correct 
classifications corresponds closely with increase 
in level of delinquency, a marked drop in predic- 
tive efficiency at level 7 being the only exception 
in the general trend. 


Summary 


This study has demonstrated the usefulness of 
applying multivariate statistical techniques to th 
analysis of MMPI scale scores alone, or in © 2 
bination with school data, for the purpose of эв 
sifying ninth-grade boys ав potential delin » ie 
or non-delinquents. The techni ques anit ыы 
proved to be effective to the extent that 62 io Si 
cent of the non-delinquents and 69. 5 percent S the 
delinquents were correctly identified by the ches 
Multiphasic data alone, while acombination ҮМ, | 
tiphasic and school record data made 05 ib “= 
correct identification of 74.2 percent d ШО, m 

-delin- 


quentboys and 67,5 с 
boys. Percent of delinquent-prone 
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ON PROGRAMMING THE INTEGRAL RTH 
ROOT OF A NUMBER ON FRIDEN STANDARD 
AND SQUARE ROOT DESK CALCULATORS 
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San Diego, California 


1. Introduction 


IT IS THE purpose of this note to indicate some 
formulas which are useful for finding the roots of 
real numbers on Friden Calculators. The meth- 
od used to derive the formulas is due to Newton 
although its application for taking advantage of the 
Square-root feature оп certain Friden Calculators 
is believed to be new. The mathematics of the 
derivations are given inthe third section of the 
paper since this information is probably of least 
interest to those who will operate with the form- 
шав. Тһе second section contains three examples. 
The cube root example is symbolically pro- 
grammed for the calculator in section 4. Ta- 
ble I presents a set of formulas which are effi- 
cient for finding the rth root of a real number. 
That is, for finding the square root, cube root, 
etc., of some number, N. The symbol x in the 
formulas stands for a guessed or trial value of the 
root and conc eivably may be grossly in error. 
However, ifitis considerably in error, the form- 
ulas will ultimately provide an accurate estimate 
of the root although it will take a longer time than 
if an estimate only slightly in error were used. 

The method to use with the formulas involves 
essentially selectinga trial value, substituting 
this (as x) in the appropriate f o r mula and there- 
by obtaining a new trial value. This new value is 
again substituted intheformula (as x). This iter- 
ation process is repeated until the desired deci- 
mal accuracy is obtained. 

The formulas in Table I have been arranged so 
that the arithmetic ultimately involves the accu- 
mulation of two quotients. Therefore, the proper 
machine set-up requires the NON ENT key to be 
down. Alsoitis advisable that the ADD key is in 
the down position. In some of the formulas the 
divisors of x appear as improper fractions (e. g., 
178 In actual computations, these numbers can 


*The opinions expressed are solely those of the author a 
Strued as representing those of the U.S. Naval Perso. 


Personnel. 


be changed to decimals appropriate to the accu- 
racy required or the problem can be treated as a 
simple quotient (e. g., 3x/4). 

The Square Root Friden enables all roots of the 
form r- 21, where i - 1,2,3,..., of a number to 
be taken directly without the use of any special 
formulas. For instance, to obtain the fourth root 
of 2. 0000 we merely findthe VZ = 1. 41421 and then 
find / 1.41421 = 1.1892 which is an accurate esti- 


3 

mate of 24, The higher order roots are difficult 
to obtain on the standard Friden since the proced- 
ure requires the trial values to be raisedto large 
powers. The Square Root Friden is gener ally 
more efficient for both the high and low order 
roots since the quantities involved in the quotients 
are easier to calculate and moreover the Square- 
root formulas provide estimates which converge 
faster to the root desired. 


2. Examples 


a) Consider first the problem of taking the cube 
root of 27. The answer, of course, is 3, but 
nevertheless let us see how the method works on 
a problem for which we know the exact answer 


Suppose for our example that we wish Our answer 


to be accurate to four decimal places. ү, 

у . е 
decimal on the keyboard, lower dial ү x 
dial to one more than this, 5, у 4 


псе we will 
oruse the Square Root 


i nner 
ing a cube root, the prope ) and we are seek - 


ын г formula from Table 1 
N x 
Зх® 1.5 


We first guess atrial value of x = 2,0000 ва, 
. , у, 


nd are in по way 


official; 
mnel Research Fi ; hor are they tobe con- 


eld Activity or Bureau of Naval 
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TABLE I 


FORMULAS FOR FINDING ROOTS OF NUMBERS BY ITERATION 


М 
Be San Square Root 
2 (Square) N Boa а я 

2x 2 


3 (Cube) 

4 (Fourth) 
5 (Fifth) 

6 (Sixth) 

7 (Seventh) 
8 (Eighth) 
9 (Ninth) 


10 (Tenth) 


N = Number 
х = Trial Value of Root 


dard Friden 


ск 
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and copy this value on а worksheet. The number 
x is entered on the keyboard and also in the mul- 
tiplier dial. The MULT key is punched and the 
resulting square is rounded to 5 decimal places 
and recorded on the keyboard. Then 4.00000 is 
multiplied by 3 and the product, 12. 00000 is 
enteredonthekeyboard. The upper dial is cleared 
by hand, Next the number N = 27 is entered with 
the twirler knobs in the upper dial around the 10th 
decimal place and the dividing keys are pressed. 
Now 2.25000 appears in the lower dial which 
should now belocked. The upper dial is clear but 
in general it will notbe, so that the CARR CLEAR 
will have to be punched. We have now found the 
first of the two quotients. Next x = 2.00000 is en- 
tered around the 10th decimal place in the upper 
dial and 1.50000 (the constant divisor) is entered 
on the keyboard. After dividing, we find 3. 58333 
in the lower dial. This number is our second ap- 
proximation. After copying this on our worksheet 
we clear the upper dial and lower dial, and unlock 
the lower dial, and proceed to square 3. 58333 
and start repeating the evaluation of the formula. 
After six trials the worksheet would appear as: 


Trial x 
1) 2.00000 
2) 3.58333 
3) 3.08980 
4) 3.00257 
5) 2.99999 
6) 2.99999 


Because the fifth and sixth trials produce the 
same number, we cease iterating since a stable 
trial value has been determined. Therefore, the 
estimate of the cube root of 27 is 3.0000 which is 
accurate to four decimal places. 

b) The same problem will now be worked using 
rootfeature. The appropriate formu- 


the square- 
la from Table I, in this case, is: 
ҮН Г.Ж 
15x 3 


We first calculate di с 
УЗ = 5. 19615 and Y2 = 1.41421 which is the 
square-root of our first Ne: 

is copied on the key board and multiplied by 1.5. 
This product, 2.12132, is copied on the keyboard. 
After clearing the upper dial by hand, 5. 19615 is 
entered with the twirler knobs in the upper dial 
around the 19th decimal place. Division gives 
2.44948 in the lower dial which should now be 
locked. Clear the upper dial. Then enter х = 
10th place in the upper dial. 
constant divisor) on the key- 
um ulated quotient is 
alue is recorded on the work- 
d trial value. Next we 


calculate УЗ. 11614 and repeat the process. The 


lower dial should be unlocked when starting a new 
iteration. The worksheet would appear as: 


Trial E 
1) 2.00000 VZT = 5.19615 
2) 3.11614 
3) 3. 00108 
4) 3. 00000 
5) 2.99999 


Notice that one less iteration is required for the 
same problem by this method than for the method 
used in sample a). 

c) As a final example consider the problem of 
finding the fifth root of т = 3.14159 - N, using the 
Square Root F riden. From Table I the formula 
to be used is: 


1 
№ х 


— + 
1. 25x* 


a 1 
The first task is to find N* = (3.14159)4. Since 


[3714188 = 1.7724531, then № = VI. T4531 = 
1.33134. This number is recorded on the work- 
sheet. Because the fourth root of 7 is about 1.3, 
a good trial value for the fifth root is x = 1. 20000. 
The fourth root of x is now found and multiplied 
by 1.25. The resulting product, 1.30830, is re- 
corded on the keyboard and 1. 33134 is entered 
around the 10th decimal place in the upper dial. 
Division gives 1.01761 and the lower dial is locked. 
Clear the upper dial, enter 1. 20000 in the upper 
dial around the 10th decimal place, and record 
5.00000 (the co nstant divisor) on the keyboard. 
Division produces 1.25761 in the lower dial and 
this value is the new X. After four trials, the 


worksheet appears as: 


Trial кезш. 2 
i 
1) 1.20000 (3.14159)* = 1.33134 
2) 1.25761 
3) 1.25726 
4) 1.25727 


Therefore, the fifth root of т is 1.2573 which 15 
accurate to four decimal places. 


3. Mathematical Derivations 


Newton’s method for finding roots of i 
is well-known and is ree e in шаг 
many theories of equations and calculus textbooks 
Essentially the method says that the roots of эх 
equation in x are approximated by the following 
expression: 


К 
x - 9 
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where f(x) is the function (or equation) of х, the 
Р (x) is its derivative. The functions of х used in 
this paper are very simple since we seek Only the 
values of x satisfying the equation хі = N, where 
ris some positive integer. Of course f(x) = xT - 
N. We could consider more general functions but 
the result would be to introduce types of problems 
not frequently attempted. The derivative of f(x) 
is f' (x) = rx*-1 so that: 


xX = f(x) = X- (x^ - N) - N 4- x 
f (x) rxr-1 rxr-1 Рур | 
Upon substituting г = 258,022, 10 in the expres- 


Sion, we produce the formulas under the ‘‘Stand- 
ard Friden’’ heading of Table I. 

It occ urred to the author that it may be more 
efficient to utilize the square-root feature on cer- 
tain Friden Calculators to find the root of num- 


bers. Accordingly, f(x) was modified to be: 
1 
f(x) = хї/2 _ ма 


Newton’s expression leads to: 


W x 
rx (Г-2)/2 r/(r-2) 


2 


in this case. Substitution of r = 2 and 3 in the 
above expression gives the formulas for the square 
and cube roots in the second formula column of 
Table I. One could also use Бф буна ВӘ сеп- 
erate iterative formulas for other roots, but it 


1 
appears more wise to use f(x) = хг/4 _ NF for r = 
4,5, and 6. This was done and the expression is: 


1 


N* dà X 
rx (r-4)/4 r/(4-4 
4 
For г = 7,8 and 9, one might consider f(x) = x1/8 
- N1/8, 

The general principle being used is now clear. 
The Square Root Friden formulas are derived 
from Newton's ex ression utilizing a function of 
the form f(x) = хг/К - МИК where k = 2,4,8,16, 
+++, Newton’s formula for the general case 15: 


x1/k 


rx U-E)/k | yee ' 
k 


To find the rth root of N, locate r between two 
Successive k’s where k is an integral power of 2. 
In case г is equall y distant between two k's use 
the smaller k. Substitute k andr in Newton's gen- 
eral formula above and solve for the specific iter- 


ation formula. Particular formulas are givenin 
the third column of Table |, 


4. Programming a Desk Calculator 
----2322НанЫ а Мезк Calculator 


The art of computation by means of desk calcu- 
lators has been taught primarily by the demon- 
stration of technique by the more experienced to 
the less. This unfortunate situation exists because 
it is usually much easier to show a student how to 
use a desk calculator than to tell him how, and ad- 
equate instructional materialsarescarce. Calcu- 
lator manufacturers have not been too successful 
in producing manuals which have been useful to 
novices. Frequently manuals and books on desk 
calculators have attempted to illustrate computa- 
tions for particular problems. Consequently, it 
is often the reader’s unpleasant duty to attempt to 
adapt the principles or steps of the computation 
(if they can be discovered) to the special task at 
hand. | 

Some of the difficulty could be overcome if an 
efficient code were devised for the instructions. 
While the author was engaged in writing statistical 
programs for a digital computer, it occurred to 
him that instructions similar to those used by сош- 
puters could be devised for desk calculators. An 
extensive list of instructions for Friden calcula- 
tors has been defined. Rather than list these in- 
Structions here, suffice it to say that they refer 
to a) the set-up of the re gisters and keys of the 
calculator, b) arithmetic Operations, с) the trans- 
fer of information, d) branching and decision mak- 
ing, and e) tallying and control. These desk cal- 
culator instructions could be givena mnemonic 
code (such as CAD for “Clear and Add’’) in the 
same manner as digital computer instructions, 
but this does not seem particularly necessary and 
would impose an additional learning task. The 
advantage of desk calculator instructions include 
the increased generality of computation illustra- 
tion, the compact description of computations, 
and à permanent and convenient record of proced- 
ures. The author believes that the actual demon- 
stration of technique remains the most efficient 
method of learning desk calculator computation 
but he also believes the programming of often 
used and important procedures bears serious con- 
sideration, particularly for instruction manuals, 
and statistical and computation laboratories. | 

In order to illu strate how the programming 
might appear, aprogram for finding the cube root 
of N is provided in Table Il. The reader is invit- 
ed to follow Simultaneously the verbal description 
in the first example in section 2 of this paper and 
the commands in Table П. Each command is giv- 
еп a number called a Step Number and a Refer- 
ence. The Step Number provides the order in 
which the command is executed. The Reference 
refers to such things as a set of registers, а соп- 
Stant or variable, pair of numbers in registers, 4 
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direction on how to setar egister ог key, ога 
Step number. The Reference is divided essential- 
ly into two parts. The first part contains such 
things as those items just mentioned, and the sec- 
ond part (separated from the first by a colon) al- 
ways Specifies the decimal accuracy or decimal 
position if it is involved in the command, 

The commands have not been Specifically de- 
fined. By reading the “Remarks” co] umn, the 
reader should have little difficulty in determining 
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the proper interpretation. Before the reader con- 
Siders Table П, he should know that the symbols 
used are defined аз: К = Keyboard, M = Multi- 
plier Dial, А = the Upper Dial or Register, @ = 
the Quotient or Lower Dial, d = thedecimal accu- 
racy desired, and x = a trial value of the cube 
root of N. The command “STANDARDIZE”’ means 
that all the dials and registers of the calculator 
are unlocked and cleared, and all the operational 
keys are put in their usual (UP) position. 


JOURNAL OF EXPERIMENTAL EDUCATION 
(Volume 28, Number 3, March 1960) 


MUST ALL TESTS BE MULTI-FACTOR 
BATTERIES? 


EDWIN Е. ROSINSKI 
Medical College of Virginia 
Richmond, Virginia 


MUCH OF the present day literature on meas- 
urement concerns itself with the application and 
use of multi-factor test batteries. This trend has 
become so pronounced that journals are publish- 
ing series of articles on the use of these batteries. 

In one of these series of articles, Super stated 
that if the detailed data on validity and norms in 
the Psychological Corporation's Differential Ap- 
titude Tests and the United States Employment 
Service's General Aptitude Test Battery are as 
adequate as claimed, then: 


....the use of singly developed and validat- 
ed aptitude tests in counseling will in due 
course be virtually a thing of the past: the 
multipotentiality of the multifactor test bat- 
tery gives too many advantages. 1* 


It appears as if Super's statement is not so 
much a prophecy but a recognition of a reality that 
is already present. With a pronouncement such 
ки зои иог озшен the ultimate question arises: 
DS gl re tests be replaced by multi-fac- 

r batteries? It appears, however, that before 
this question can be answered, there is a need 
first to investigate the relationship between these 
single score tests and m ulti - factor batteries to 
ascertain if eachis performing a different function 
or the same function. The Psychological Corpor- 
ation's Differential Aptitude Tests (hereinafter 
referred to as the D. A. T.) and the World Book 
Company's Otis Quick Scoring Mental Ability Test: 
Gamma Form (hereinafter referredtoasthe Otis) 
were used to study this relationship. The deci- 
sion to use these two tests appeared practical for 
two reasons: (1) the D.A. T. as a multi-factor test 
is gaining in prominence and use and includes in 
the manual a great deal of standardization data, 
and (2) the Otis is one of the most commonly used 
tests providing а single score IQ. 


Procedure and Results 


A sample of 153 students of asenior high-school 
classwas selected. From the permanent record 


жАП footnotes will be found at end of article. 


cards ofthese senior students the appropr iate 
D. A. T. scores and the Otis IQ scores were taken. 
The former test had been administeredto the stu- 
dents when they were inthe tenth grade and the 
latter when the students were in the twelfth grade. 
There is no appar ent reason to suspect that this 
recognizably selected sample should in any way 
contaminate the results. Using these raw data 
from the Otis and the D. A. T. scores, means, 
standard deviations and Pearson product-moment 
correlations were obtained. 

After these correlations were found, it was nec- 
essary to determine partial r's and partial stand- 
ard deviations.3 Next the formula for Multiple R 
was used and a multiple correlation of .82 was 
computed. Not only does this prove to be signifi- 
cant at the . 01 level of confidence but it is an ex- 
tremely high correlation. 

The next step involved the determination of the 
multiple regression equation or predicting the first 
variable (Otis) on the basis of the two known vari- 
ables (D. A. T. Verbal and Abstract Reasoning Raw 
Scores). With the calculation of the partial re- 
gression coefficients, values were substituted in 
Cornell’s4 multiple regression or prediction equa- 
tion and the OtisIQ score might then be predicted. 
This equation reads: 


X, = bins Xe + Dis.2 Xs +a 


where X, is the predicted Otis score; Ъ 2, з and 

b; 3.2 the partial regression coefficients .51 and 

.89 respectively; Xa the D. A. T. Verbal Reason- 

ing raw coge е, ыры D. А. T. Abstract Reasoning 
м score; ап а” the cons i 

tion determined to be 10. 5. SUPE ME 
Therefore, with the above equatio 

values, a hypothetical example a be = an 

suming that student A has а D. A. T. Verbal oni 

Abstract Reasoning scores of 29 and 34 respe 

tively, what will his predicted Otis IQ be? шил 
Substituting in the formula we have: | 


Х, 


х Біз.зХа + Баз. Хз + а 


.51(29) + .89(34) + 70.5 
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TABLE II 


ACTUAL AND PREDIC TED OTIS (GAMMA) IQ SCORES WITH 
DIFFERENCES BETWEEN THE TWO 


Actual 
Otis IQ 


Predicted 
Otis IQ 


104 

99 
108 

99 
104 
108 
102 
101 
108 

90 
129 


Difference 


со к gi со to со ت‎ -1 O 


- 


- 
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Student 


42 
43 
44 


Actual 
Otis IQ 


Predicted 
Otis IQ 


108 
104 

99 
112 
104 
123 


Difference 
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TABLE Ш 


NUMBER AND PERCENT OF PREDICTIONS MADE WITHIN CERTAIN 
DIFFERENCES BETWEEN ACTUAL AND PREDICTED 1Q’S 


Diff. Between 
Actual and Pre- No. of 


Percent of Cumulative 
dicted IQ Predictions Predictions Percentage 
0 5 7 Т 

1 10 12 19 
2 19 11 30 
3 11 13 43 
4 8 10 53 
5 8 10 63 
6 5 Ч 70 
m 8 10 80 
8 2 2 82 
9 5 7 89 
10 1 1 90 
11 1 1 91 
12 0 0 91 
13 3 3 94 
14 2 2 96 
15 1 1 9" 
16 1 1 98 
17 0 0 98 
18 1 1 99 
19 1 1 100 


TABLE IV 


MEANS, STANDARD DEVIATIONS AND DIFFERENCES IN 
PREDICTED AND ACTUAL 10 SCORES 


Actual Predicted Difference 
Mean 106.0 106.2 0.2 
Standard 
Deviation 9.8 10.1 0.3 


c cc LN 
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14,79 + 30.26 + 70.5 
115.55 


Х, 
Х, 


Rounded off to the nearest hundred, we have stu- 
dent A’s predicted Otis IQ as 116. 

Nevertheless, the task of predicting presented 
another problem. In attempting to predict sever- 
al Otis 1078 it quickly became obvious that such а 
procedure of substituting in a formula was suitable 
with a small number ofcases. On the other hand, 
if this was to be accomplished for a large group, 
it would be much too time consuming. Conse- 
quently, a prediction table was constructed by 
Which the Otis score could be easily determined 
once the D. A. T. Verbal and Abstract Reasoning 
Scores were known. Such a table was completed, 
but since itis large (49 by 47 columns) only a por- 
tion is here reproduced (Table D. 

To test the prediction equation or, as it devel- 
Oped in this case, the predictiontable, a new sam- 
ple of students was selected. An entire senior 
Class of 82 students in the same school where the 
Original correlations were made was used. Again 
from the permanent record cards of these stu- 
dents the D. A. T. Verbal and Abstract Reasoning 
and the Otis scores were transcribedon a master 
Work sheet. From the D. A. T. data the predicted 


Q's were made and compared with the actual IQ's. 


Р While the task of developing and recording the 
igures for the prediction table was tedious, the 
effort was rewarded by the ease with which the 
Predicted scores were made. 
ав Table П represents the findings of this ргосе- 
al ге of predicting scores: the actual IQ scores 
eins. with the predicted IQ scores and the differ- 
it сев between the two. Prior to reading the table 
1$ important to remember that one standard er- 
Tor of estimate for this prediction is 4. 3 and the 
Usual procedure is to expect, by the definition of 
а Standard error of estimate, approximately 90% 
Of the predictions within two standard errors of 
estimate, that is 90% of the predictions should fall 
Within 8. 6 (9) points of the predictedIQ. Table Ш 
shows the number and percent of predictions made 
Within a certain difference. 

From these two tables it is obvious that the 
Predictions are not all perfect. This might well 
be expected; for perfect correlations, while highly 
desirable, are extremely unlikely. Yet, it does 
appear significant when considering the number of 
Predictions that fell within the two standard errors 
of estimate and the reliability of the Otis itself. In 
this case, the prediction table and/or prediction 
equation appear to have some value. 

__Регћарз citing another figure might be more 
illustrative of the value of this pred iction, for 
Studies of such anatureasthis are primarily con- 
Cerned with mean scores and mean values. There- 
fore, when comparing the mean IQ's and standard 
deviations of the predicted and actual scores in 
Table IV, the accuracy of the predictions seems 


more vivid. 

The greatest value, perhaps, of this study lies 
in this table. Itappears almost extraordinary that 
predicting group mean IQ's could be accomplished 
with such a degree of accuracy as illustrated here. 


Summary 


In correlating the D. A. T. Verbal and Abstract 
Reasoning subtests with the Otis, а multiple cor- 
relation coefficient of .82 was found. From the 
multiple correlation coefficient, the prediction 
equation was developed and predicted 90% of the 
cases within two standard errors of estimate. 
While all the predictions were not perfect, the 
Multiple R was almost identical to the Otis relia- 
bility coefficient. In addition, while differences 
in individual predictions were noted, the predict- 
ed mean group 1075 wereas close to the actual 1Q’s 
as possible (106 and 106. 2). 


Conclusions 


From the data presented in this study, several 
conclusions seem appropriate. 


1. Because of the significant and high Multiple 
R (.82) itappears as if the D.A.T. Verbal and Ab- 
stract Reasoning subtest are performing the same 
function as the Otis, i.e., measuring intelligence. 

2. Because of the significant and high Multiple 
R, it appears asif the D.A.T. Verbal and Abstract 
Reasoning subtests can predict Otis IQ scores. 

3. In actually predicting Otis 1075 on the basis 
of D.A.T. Verbal and Abstract Reasoning subtests, 
approximately 90% of the predictions will fall with- 
in nine points. 

4, Predicting mean Otis IQ’s for groups can be 
accomplished with considerable accuracy and con- 
fidence. 


Several implications manifest themselves as a 
result of this study. If the multi-factor tests 
measure the same factors as some Single score 
tests, then it only seems logical that they might 
ultimately replace these single score tests, but by 
the same token if they are measuring the same 
factors as single score tests, why not continue us- 
ing these single score tests? It appears that even 
though the multi-factor tests can, with some de- 
gree of accuracy, dothesame task as single tests 
the rush toward these multi-factor tests should be 
a considered one until the predictive value of these 
tests is subs tantated with proven data. This is 
especially pertinent when considering the amount 
of time that each test takes. The Otis requires 
thirty minutes testing time while the D. A. T. re- 
quires approximately three hours nti 
the time required for directions d хи” 


— | 
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An argument for multi-factor test batteries 
could bethe possible multipotentiality of the multi- 
factor tests, i.e., that the various Subtests could 
be recombined to predict other phenomena. 

What it seems to boil down to is this: that both 
tests tend to measure the same factors, but be- 
cause they measure the same factors it does not 


FOOTNOTES 


1. Donald E. Super, ‘‘The Use of Multifactor 
Test Batteries in Guidance, > Personnel and 


Guidance Journal, XXXV (September 1956), 
с з UH уван 
p. 14. 


2. 1955 Invitational Conference on Testíng Ргођ- 
lems, sponsored by the Educational Testing 
Service of Princeton, New Jersey. 

Robert D. North, TheUseof Multi- Factor Ap- 
-- en actor Ap 


necessarily follow that one should replace the other. 
Test users perhaps should consider some gross 
criteria before дес] ding on a particular test: (1) 
the purpose behind using a particular test, (2) the 
time allotted to accomplish the particular purpose, 
and (3) what the testor and testee are most com- 
fortable in handling. 


titude Tests in School Counseling Proceed- 
ings of the 1955 Invitation Conference on 
Testing Problems, New York, October 29, 
1955, pp. 11-15. 

3. Any standard test on statistics in psychology 
and education provides formulae for these 
Values, 

4. Francis С. Cornell, The Essentials of Educa- 


Чопа] Statistics (New York: John Wiley and 
Sons, 1956), р. 320. 
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AN EVALUATION OF THE LABORATORY IN 


A COLLEGE GENERA 


L BIOLOGY COURSE 


DOUGLAS M. DEARDEN 
University of Minnesota 


THE RESEARCH reported to date in the liter- 
ature indicates that no one type of laboratory 15 
Clearly superior in developing all of the outcomes 
desired from thescience laboratory. The accept- 
able science studies indicate, for example, that 
the conventional type of laboratory has proven su- 
perior to other laboratory me thods in teaching 
laboratory skills, but not in promoting the acqui- 
Sition of facts. 

The research reported in this study was con- 
ducted to hel p determine the contribution of sev- 
€ral types of biology laboratories in a college bi- 
ology course designed for fres h m en and sopho- 
mores who do not plan to major in a science. 


Experimental Treatments 


The four experimental treatments compared 
е (1) individual (conventional) laboratory, (2) 
emonstration laboratory, (3) workbook exer- 
Clses, and (4) a biological term paper. 
ote individual laboratory work was based on 
с Oratory work found in many of the published 
| ере laboratory manuals, and involved a week- 
y two-hour laboratory period. Тһе demonstra- 
tion laboratory required one laboratory hour per 
Week. The laboratory instructors had set up ex- 
periments and materials prior to the laboratory 
Period for the students to observe and study. This 
demonstration type of laboratory eliminated many 
of the more routine aspects of the laboratory work 
for the student and allowed them to concentrate on 
the important points and principles. 

The workbook exercises were submitted week- 
ly and were based on questions in the demonstra- 
tion laboratory manual. The answers to these 
exercises had to be obtained from non-laboratory 
Sources, suchas texts. The biological report 
was used as an experimental control. 

The lecture portion of the class was presented 
by one lecturer who kept the lectures as constant 
as possible for all sections of the class. 


Measuring Instruments 


Three tests were used to evaluate the achieve- 
ment of students randomly assigned to the four 
experimental treatments. The students took these 


tests before beginning the course, upon comple- 
tion of the course, and three months after com- 
pleting the course. The three tests were tests of 
Biological Knowledge, Scientific Thinking, and Bi- 
ological Attitudes. These tests were validated and 
the reliability coefficients for the tests were com- 
puted. The reliability coefficients were computed 
by the split-halves method, using the maximum 
likelihood estimate of the correlation coefficient 
and then correcting this coefficient with the Spear- 
man-Brown formula. The correc ted reliability 
coefficients were 0.87 for the Biological test, 0.86 
for the Biological Attitude test and 0.85 for the 
Scientific Thinking test. 


Experimental Population 


The students involved in the experiment were 
the 924 students who com pleted the one quarter 
General College Human Biology course, G.C. 10-A 
during the 1958-59 ac ademic year. These stu- 
dents were enrolled in the General College, Home 
Economics, Education, Mortuary Science, and the 
College of Science, Literature, and the Arts. The 
students in the six sections of this course were 
randomly assigned in approximately equal num- 
bers to the treatments through the use оға table 
of random numbers. 


Experimental Design 


A stratified randomized design was used in this 
study, with the exper imental treatments being 
randomly assigned over all of the experimental 
material. The students were stratified for col- 
lege and sex prior to being assigned to treatments 
as a means of more equally distributing the stu- 
dents of various abilities. 

The desirable statistical tool totest the null hy- 
potheses of the study was the analysis of variance. 
Of the two main models of this analysis, the fixed 
effects model, Model I, is designed for studies 
such as the present study. T-tests, Duncan’s Mul- 
tiple Range Test, and the Johnson-Neyman Tech- 
nique were also included in the statistical analyses. 


Experimental Results 


A comparison of the initial and final test scores 
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by means of t-tests indicated that each of the four 
treatments had significantly increased student’s 
scores on the three tests. The treatment which 
most consistently increased the variance on the 
final tests was the individual laboratory, which 
suggests that this treatment allowed more ade- 
quately for individual differences in student aca- 
demic ability. 

The overall results from the experiment are 
presented in Tables І, II, and Ш which give the 
means of the tests given at the end of the experi- 
mental period for each of the six sections. 

When the final Biological Knowledge Test scores 
were analyzed by a two-way analysis of variance 
(treatment and ability level) none of thetreatments 
was signifcantly better in increasing student 
scores. Table IV presents the analysis of vari- 
ance results for the lecture section which most 
closely approached significant results between 
treatments on this test. 

The conclusion which resulted from the analy- 
sis of the Biological Knowledge test was that none 
of the four treatments was significantly better in 
helping students learn the factual aspects of biol- 
ову. 

The Scientific Thinking tests were similarly 
analyzed by the analysis of variance technique to 
determine which treatment functioned best in im- 
proving scientific thinking by students. The dif- 
ferences in achievement on this test were not sta- 
tistically significant, indicating that none of the 
treatments was superior in improving sci entific 
thinking by students. 

When the Biological Attitude Test results were 
analyzed, significant differences were found ђе- 
tween the treatment groups in two of the lecture 
sections. The significant differences in section 
one, Fall quarter (Table V) were further analyzed 
by Duncan’s Multiple Range Test, which indicated 
that the individual laboratory group of this section 
had significantly higher attitude test scores than 
the other three treatments. In section three, Fall 
quarter, the workbook group was Significantly 
higher on this test than the other three treatments 
However, the total results do not indicate that any 
one treatment was consistently superior in im- 
proving Student's scores on this attitude test. 

The preceeding analysis of variance results al- 
so were used to determine if any treatment was 
superior so far as being more suitable for stu- 
dents of comparatively high, middle, or low aca- 
demic ability. The chosen criterion of academic 
ability was thestudent'S ACE scores. The analy- 
sis of variance results indicated that the Students 
of a given ability group achieved equally wel] on 
the tests no matter which treatment they were as- 
signed to. Thus none of the treatments proved to 
pe better for the students of any of these three 


ттт SRS лэг erences‏ م 
The technical background for this technique ig p‏ * 


ге А 
ern Statistical Methods: Descriptive Sented in th 


gon, Mod р! and Inductive ( 


comparative levels of ability. 
Analysis of the Follow-up Tests 


Three months after completing the experiment 
al course, arandom sample of students again 22 
the tests used іп this study. Of the 122 poten 
selected, 79.5 percent took the three tests. Т-165 е 
for correlated measures were used to compar 
the final and follow-up test means and пе ри 

The students in all treatment groups (епдес 1 
score lower оп the follow-up tests of Biologici 
Knowledge and Attitudes. However, each 5 dis 
treatment groups increased their scores on 
follow-up Scientific Thinking Test. In nar 
the variances of the final and follow-up tests, С 
individual laboratory group most consistently 5 
creased the variability in scores on the дан poe 
tests, indicating that this treatment allowed то 
adequately for individual differences in те 

A two-way analysis of variance of the [010 
up tests (treatments * number of additional ирж 
courses taken by students) indicated that the о- 
treatments were not significantly different in pr 2 
moting a more permanent learning by students 
biological attitudes, facts, or scientific thinki an 
This conclusion held both for those students V 5 
did or did not take ап additional biology COYS 
after completing the experimental course. 


Johnson-Neyman Technique* 


While no significant differences were found dge 
tween the treatments on the Biological Knowle и 
and Scientific Thinking Tests, some differents 
уеге noted on the Biological Attitude Test. от 
it was advisable to collate the information eaa 
all of the six single experiments to see if une 2 ђе" 
bined data indicated any significant diff erences tion 
tween the treatments. А qualitative modifica t 
of the Johnson-Neyman Technique was use jos 
combine and collate the data from the six $P 
rate experiments, red 

In this analysis, two treatments are compar 
atone time. For exam ple, when the comb re 
demonstration laboratory groups were сот эн оп 
with the combined individual laboratory grou а) 
the Attitude test, the difference (D) in mean finis 
Attitude test sco res between the two treatm 


si 
ma the variance (Sj) of D were computed. 4 
е value of to was calculated, where 
уа wA, 
о gi 
D „јаје 
еее to the table of t, with the approp” an 
я Е 868 Of freedom, the null hypothesis of [d be те“ 
erences between the two groups could 
art Jacke 
е text by Palmer О. Johnson and Robe? 1427-8. 


Chicago: Rand McNally апа Со., 1959)» 
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TABLE I 


MEAN SCORES OF STUDENTS ON THE FINAL BIOLOGICAL KNOWLEDGE TEST 


SS‏ س 


Lecture Section 


ee ЕЕ‏ س 
Sec. 1 Sec. 2 Sec. 3 Sec. 1 Sec. 2 Sec. 1‏ 


Treatment Fall Fall Fall Winter Winter Spring 

“сс es ee ee == eee 

Demonstration Lab. 54.16 50.04 52. 92 50. 64 55.50 48.81 

Workbook 52. 31 52. 06 55. 33 51. 29 53. 36 53.05 

Report 51.44 49.75 53. 67 49. 69 52. 60 50. 42 

Individual Lab. 56. 40 53.13 56. 42 46. 13 52. 83 50. 62 
TABLE II 


MEAN SCORES OF STUDENTS ON THE FINAL SCIENTIFIC THINKING TEST 
سسس‎ ыш Житие 
Lecture Section 


Sec. 1 Sec. 2 Sec. 3 Sec. 1 Sec. 2 Sec. 1 
Treatment Fall Fall Fall Winter Winter | Spring 
ре. a gg — sinc PER ын DEM БРЦ л» АНШИ... Й 


' Demonstration Lab. 32. 04 30. 20 28. 82 30.73 32. 97 34.52 
Workbook 32.19 31.50 31.33 32. 26 33. 41 31.62 
Report 30. 75 31.78 29. 84 30. 00 32. 29 32. 58 
Individual Lab. 35. 53 30. 00 29.75 28. 33 29. 50 35. 90 
шапа a т ао. 
TABLE III 


MEAN SCORES OF STUDENTS ON THE FINAL BIOLOGICAL ATTITUDE TEST 


Lecture Section 


Sec. 1 Sec. 2 Sec. 3 Бес, 1 Sec, 2 Sec. 1 
Treatment Fall Fall Fall Winter Winter Spring 
a re ci, sU 
j Demonstration Lab. 11.27 11.58 11.82 13.73 12.81 12.57 
Workbook 12. 35 12. 56 14.15 13.76 13.23 13.19 
Report 12.00 9.90 13.86 12. 38 14.07 15. 46 
Individual Lab. 15, 87 14,47 12,75 12. 60 13. 22 13. 19 


Uaa‏ ڪڪ 


— = 
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jected or accepted. 

Using this technique to compare each of the 
combined treatments with every other combined 
treatment, the conclusion was drawn that none of 
the mean test scores were significantly higher at 
the .05 level of significance. Thus, at this level, 
none of the treatments was significantly better in 
improving the mean final scores of students on 
these three tests. However, at the .10 level, the 
individual laboratory, when compar ed with the 
demonstration laboratory, was associated with a 
significant increase іп scores on the Biological 
Attitude Test. A summary of the results of all of 
these comparisons 15 presented in Table VI. 

A sample of the experimental groups of students 
was compared with two random samples of stu- 
dents who hadcompleted this course several years 
ago. The ACE scores of these samples of stu- 
dents were compared by an analysis of variance 
technique. The experimental group was not sig- 
nificantly different, with г egard to ACE scores, 
from the students of previous years. Thus the 
results of this experiment can be generalized to 
that population of students taking the experimental 
course, assuming that the characteristic of future 
students do not change. 


Summary 


Four experimental treatments (individual or 
conventional laboratory, demonstration laboratory, 
workbook exercises, and a biological term paper) 
were statistically evaluated in a college general 
biology course. Tests of Biol ogical Knowledge, 
Scientific Thinking, and Biological Attitudes were 
used to compare the achievement of students ran- 
domly assigned to these treatments. 


A significant increase intest scores was а550- 
ciated with each treatment at the end of thein- 
structional period, butnone of the treatments was 
superior to the other tr eatments in increasing 
student’s scores. Each of the treatments allowed 
for individual differences in the academic ability 
of students, but the individual laboratory allowed 
more adequately for these differences in learning 
ability and thus allowed the students to learn at a 
rate more consistent with their ability. 

In comparing the test results of students with 
comparatively high, middle, and low academ ic 
ability, none of the treatme nts was found to be 
more suited to students of a par ticular level of 
ability. 

At the .10 level, the Johns on-Neyman Tech- 
nique indicat еда difference between two of the 
treatments. When the individual laboratory and 
demonstration laboratory groups were compared, 
the individual laboratory groups had significantly 
higher mean test scores on the Biological Attitude 
Test. 

None of the treatments was significantly sup e- 
rior in promoting a more permanent learning of 
the course materials, as measured by tests ad- 
ministered three months after the students had 
completed the experimental course. 

The results of experimentation can provide use- 
ful practical results. Based on the results of this 
experiment, a combination of these methods could 
be used to good advantage. The advantages of all 
four treatments could be incorporated into acourse, 
with particular emphasis being placed on the indi- 
vidual laboratory because of its better adjustment 
to individual differences in learning ability of stu- 
dents. In addition, if the four treatments were 
carefully integrated into a course, student interest 
in the course materials could be increased. 
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APPLICATION OF MATRICES IN THE ANALY- 
p SIS OF SOCIOMETRIC DATA 


JOHN X. JAMRICH 
Michigan State University 
East Lansing, Michigan 


` A NUMBER OF research workers in the field a11 a12 b11 b12 
( of sociometry have clearly identified the inad- а) My = and Мә- 
| equacy of the ordinary methods used in the analy- a21 a22 boi Бог 
| sis ОГ sociometric data. Briefly, the inadequacies 
stem from the fact that ordinary methods of anal- is given by 
ysis do not dete rmine, uniquely, status levels of 
the individuals or of sub-groups of individuals in aj] + P11 ауд + Ъ12 
the group being studied. Furthermore, present (2) М+Мо- 
methods lack specific quantitative characteristics ag, + boi agg + bog 
to permit analysis of change in the social structure 
of a group, іп the relative status of an individual The product of two matrices, W ith elements as 
У within the group, ог of the sub-groups, usually re- above, is defined as: 
ferred to as cliques. 
Katz* has presented a mathematical derivation (3) a11 611 + 212 іі ап 512 + 212 099 
of a “status vector” of a given group. The deri- M4 ‘Maz 
vation of the numerical elements of this vector 15 a21 P11 + 222 bo; a21 P12 + 322 9992 


а relatively cumbersome process, involving the 
в solution of сто" simultaneous equations, where ‘‘n’’ Thus, if we have a matrix 
is the number of individuals comprising the group. 


It is the purpose of this paper to present the de- 011 
velopment of a relatively simple numerical char- 
acteristic for sociometric data which will permit: (3a) Му = 100 
1. determination of the status of an individual 110 


within the group, taking into account his choices 
and the kinds of choices by others in which he | then, Mj would be M, multiplied by itself, namely: 
is involved; 
ЭР” 011 DE 210 
2. determination of a characteristic vector Тог the а 
entire social group so that it may be compared (4) Му = 100 Е 100 = | oad 


with other groups and with sociometric data for 


the same group at a later date; 110 110 111 


3. the identification of cliques, and their relative and 
status within the group, and changes which may 
be taking place from time to time as new soci- 011 201 291 


оте ле data are gathered for the same group. 3 
(5) мМу+М1= | 100| «| O11) = 111 


І The understanding of the development and сот- 
putation is based on the simple concept of addition 110 111 Tn 
and multiplication of matrices. By definition, the 


у sum of two matrices: For later references, it should be pointed out 


*Leo Katz. “А New Status Index Derived from Sociometric Analysis," Psychometrika, ХУШ (March 
1953), рр. 39-43. 
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that the sums of the columns of the matrix Мі» 
Мі аге 
(6) (5, 5,3). 
і imple 
sider, as an example, the following simp 
“ээж inv olving three individuals, 41, Ао, 


and A3, each individual having made one or two 
choices of equal “strength. > 


Аз 


Е " Е" 


Ag 
The matrix Tepresenting this Sociogram will ђе: 
Ay Ag Аз 
А 1 1 
(8) My = 4100 


Ам 1 9 


be Precisely 


n 
the one used ав ап illustration in (3a) 


NS multiply the matrix in (8) by іп (8 wel шиш 
210 

(9) Mj = 014 
1211 


И we consider М1 to be a com munication ша- 
trix, then іп the square of it, мг 
ith row, jthcolumn Bives the ny 
communications between Ai and Fo 
ate once with A1 in 
while A2 cannot Communicate With Aq in 
We might Say, then, that th 
number of two-stage relati 
ponding individuals. 
Now forming the Sum, Му + y? 
matrix (10) shown at the top of colum 
е; 
E the interpretation of ( 


: 8) and (g) 
that the entries in (10) repres 


» We see 
ent the umber of 


221 
The column sums of 


00 му+му= 111] (0) ave (5, 5, 3) 


221 


vidual within the group, 


ката 
terpreted as “оцбуаг4” one- and two-stage rel 
tions or communications. ) 


Now, а consideration of the matrix in (10) and 
(ће ori ginal 


facts, First, the diagonal of the matrix in (10) 


Thus, according to 
» A1 ahd A2 have equal 


Sing this same £roup of three individuals, let 
i Which the column-sum 


For this purpose 
quent application of ше 
following sociogram: 


Аз 
^" "ati 
Аҙ 
Whose Matrix is, 
011 
M = 1:03 
110 
Тће Square of this Matrix 18 
211 
(12) 1 
Mî = 121 
1123 
and the sum 


tia 
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мізмі- | 2 2 2 


The column-sum vector is (6,6,6). In other 
words, compared with the column-sum vector of 
the sociogram of (7), it is evident that all three of 
the individuals enjoy two mutual choices, as re- 
vealed by the diagonal values in(12), and, they are 
all of equal status. 

Remembering that these illustrations assume 
two choices for each individual, and two choices of 
equal strength, let us illustrate the utilization of 
the method with a more complex group, Say, of eight 
individuals. First, let us consider the following 


sociogram for these eight: 


Ag А5 


The matrix for this sociogram would be 


01010000 
10000001 
00010010 

(13) C= 10000010 
0.0 0:0. 0 1. 0 1 
01001000 
00110900 
й0001190 

and the square of it is 
20000011 
01011100 
10110010 
2 01120000 

ы С = | 01002100 
10000102 
10010020 
01001101 


The sum, С + С? is shown at the top of the next 
column, and the resulting column-sum vector is 


(16) v. (7,6, 3,8, 6, 6, 6, 6) 

(Ay Ад Аз Ад АБ Ag Ат Ag) 
From the values of the diagonal in (14), it is 

clear that Ај, Ал, Аб, and Ат each have two mu- 


21010011 
11011101 

10120020 

à 11120010 

(15) саб = | 01002201 
11001102 

191200020 

01002201 


tual choices to their credit. From the character- 
istic column-sum vector, it is seen that А4 has 
the ‘‘most’’ status, which fact can be substantiat- 
ed from the original sociogram, 51, for the pur- 
poses of this discussion. Similarly, A1 is seen to 
have the next highest status designation, followed 
by А5, Аб, Ат and Ав. The latter four are indi- 
cated by the vector sums as being of equal status, 
a fact which can again be checked in the original 
восторгат, 

It may ђе worth noting that the column-sum 
vector values of Ај and Аб аге 7 and 6 геврес- 
tively. From the sociogram, one would conclude 
that these two individuals have equal status on the 
basis of the fact that each enjoys two mutual 
choices. The values of 7 and 6 in the column-sum 
vector indicate the fact that Ај has one more two- 
stage ‘‘inward’’ relation inthe group than does A5, 
a fact which may be of some importance in the 
inter-group relationships. 

The interpretation of the column sums might 
be illustrated at this point by showing the meaning 
of the number “77 which is the value associated 
with Ај. Theseven “Чругагд” one- and two-stage 
relations for Ај are as follows: 


A Age— Ag 
Ар--Аа---А1 
A Аде—— А 
A1€—— А4<—— Аз 
A  Age—— Ат 
Ay А 

Av A4 


By using the characteristic column-sum vector 
to identify the individuals with the most status 
and using the original matrix, we are able to ident- 
ify the cliques in this group and associate with 
them а status designation by using the column- 
sum vector for each individual in the clique. 

For example, from (16) we see that A4 has the 
highest column sum. From the diagonal entry in 
(14) we see that A4 enjoys two mutual choices 
which, from (13) are with Ај and Ат. Hence, A4 
А1, and Ат can be said to form a clique. i 

Similar analyses yield the following cliques, 
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the first three of which actually constitute a large 
Sub-group: 


(17) 


Now consider the same eight individuals with 
the following Sociogram at Some later time. 


a бес с> А 
27 
Аз 
Aq қ 
% Lo А, 
A6 


The matrix representing this Sociogram is 


01000001 
100004100 
90:6 13 9 9 5 
18 D = 2041040 $1 
(48) 00110000 
00100010 
01100000 
11000000 

The square of this matrix is 
21000100 
01100011 
00211000 
а _ 20131000 
(19) в 08115900 
911110006 
50011400 
11000101 


- 22000101 
11100111 
00222000 
00222000 

(20) Паре 00222000 
01211010 
11111100 
22000101 


Тће characteristic Column-vector for (20) is 
(21) (6, 7, 10, 8, 8, 4, 2, 8!) 


and the Cliques, у 


ith their characteristic column- 
sum vectors, are 


e 


2 


wi 


4. 
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A TABLE TO FACILITATE THE CALCULATION 
OF NORMALIZED T-SCORES FOR SMALL 
SAMPLES (N>5<51) 


PETER F. MERENDA 
Walter V. Clarke Associates, Inc. 
East Providence, Rhode Island 


SEVERAL AUTHORS of psychological meas - 
urement and statistics texts, Garrett (1) and Ed- 
wards (2), for example, have included in the ap- 
pendix a table to facilitate the calculation of 
T-scores. The use of this table, however, re- 
quires the performing of all of the steps in the us- 
ual procedure for normalizing distributions and 
calculating T-scores. 

The table presented in this article requires 
only that the first four steps in the usual procedure 
be performed. The final step of calculating per- 


centages, whichis the most time-consuming of all 
the steps, before entering the T-score table is 
eliminated. In order to enter this table, all one 
needs to know are the number in the sample and 
cumulative frequency below a given score plus one 
half of the frequency of the given score. The ta- 
ble is divided into two parts: Table I for N’s be- 
tween 6 and 24 inclusive, and Table П for N’s of 
25-50 inclusive. The procedure and instructions 
for use with this table are given below with ex- 
ample: 


INSTRUC TIONS FOR USE WITH T-SCORE TABLE 


Step Procedure 


1. Prepare frequency distribution of scores. 


2, Cumulate scores, i.e., add each successive row of scores. 


3. Tabulate the frequency below a score + 1/2 the frequency of a given score. 


4. Enter T-score table with N (number in sample) and read standard normalized 
scores in the appropriate cell for values calculated in step 3. 


Example: 
N=10 
Cum. f. below T-scores (Read 

x f Cum. f. +1/2 f from chart) 
100 1 10 9.50 67 

90 2 9 8.00 58 

85 3 7 5.50 51 

80 2 4 3.00 45 

15 1 2 1.50 40 

10 1 1 .50 33 
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TABLE I 


A TABLE FOR CALCULATING T-SCORES 


(N > 5 <25) 
21/24 N Cum.f, d. 
ж. ЖЕ д +1/2f N +1/2f 
257 9 233% 0.5 3% + 5 x 0 
23-28 1.010 8 19 
22 6, 67 22 à 1 10 
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МЕВЕМРА 
TABLE II 
A TABLE FOR CALCULATING T-SCORES 
(N > 24 < 51) 
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31 31 31 313 
28 28 28 26 2 
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Note: N is encircled figure. 


256 


This table should prove to ђе of value to col- 
lege instructors and classroom teachers who de- 


sire to normalize distributions of scores of 


classroom tests and report scores as normalized 


T-scores. This table may be extended indefinite- 
ly, but it is felt that the maximum N of 50 will 


suffice for the great majority of classroom situa- 
tions. 
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ESTIMATION OF TEST RELIABILITY BY 
ANALYSIS OF VARIANCE TECHNIQUE’ 


CYRIL J. HOYT and P. R. KRISHNAIAH 
University of Minnesota 


Introduction 


IN AN EARLIER paper, Hoyt! showed а method 
of determining the best linear estimate of the er- 
ror variance appropriate for expressing an inter- 
nal consistency reliability coefficient for a total 
test score obtained by adding the scores on items 
scored either one or zero. In this derivation it 
was assumed that the рд errors of measurement 
in the responses of p students to eachof the q items 
were independent. Later investigation showed 
that the procedure described in this earlier paper 
is entirely appropriate when: 


1. items are scored in more than two categor- 


ies“, 
2. “Цешв” are ratings?, test scores? or other 
measures?, 


Further theoretical development in the area of 
components of variance estimation has enabled the 
authors to showthat certain assumptions regard- 
ing the independence of the errors of measurement 
and of independence of main effects (individuals 
and items) are not necessary in order that Hoyt's 
1941 formulation of the reliability coefficient es- 
timate yield appropriate ratios of expected values 
of the mean squares. 

When item effects are assumed to be intercor- 
related (see models 5 and 6) the expected values 
of the two variance components (A) and (C) used in 
estimating the reliability coefficient are precisely 
equal to those obtained for models 1, 2, 3, and 4 
in which independence is assumed as in the 1941 
derivation. 

When errors of measurement are assu med to 
be correlated for the responses ofa given individ- 
ual on different items (see models 7, 8, and 9) the 
expected values of both variance components A 
(among individuals) and C (error) are different 
from those obtained for the first six models. How- 
ever, the ratio of (A-C) to A still provides а suit- 
able estimate of the internal consistency reliabil- 
ity coefficients provided the user recognizes what 


he has. 
ee aaa 


* All footnotes will be found at end of article. 


The definition of fixed, mixed and random mod- 
els follows: 

The model equation for each of the nine lines of 
Table Iis Xjj= p+ Ci + 8j + €jj where 


where р, съ Bj are the general effect, ith indi- 
vidual effect, and jth item effect. € 1/5 are errors 
which are normally distributed. This equation is 
suitable for all three types of models with differ- 
ent assumptions on Ше oc’s and 8'5. We may sup- 
pose that the individuals and the items are drawn 
randomly from their corresp onding populations 
which contain P individuals and Q items respec- 
tively. 


A. Fixed Model 


When Р = p, and 0-4, then the above equation 
represents a fixed model. In this case, the only 
random element is the error term Eije When de- 
scribing the reliability coefficient for a given test 
form administered to a given group of individuals 
the fixed model is appropriate. 


B. Mixed Model 


When P = p and Q > q or when P > p and Q= 
q, then the equation represents a mixed model. 
(The symbol > is used to designate the relation 
as much greater.) This model is suitable for con- 
sidering the reliability obtained as an indicator of 
that appropriate for either measuring a given 
group of people on a domain of items represented 
by the sample of q items or, alternatively, for 
generalizing to a population of P subjects for any 
given administration of a certain test of q items. 


C. Random Model 


When P > p and Q > q then the above equation 
represents a random model. This model is suit- 
able for estimating the reliability coefficient for 
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TABLE I 
EXPECTED VALUES OF MEAN SQUARES FOR NINE MODELS 


—_ = ——— —— ————— سے‎ 
Model (A) Among Individuals (B) Among Items (C) Error A-C 
1. Fixed model 
Individual item effects fixed о? +-Ч уа? +P og o 4 ха 
Errors uncorrelated р-1 1 qj р-1 
2. Mixed model 
Errors uncorrelated 
Item effect random o +-9 хай 2 2 а 
Individual effects fixed No 9 «рер са pi?*i 
3. Mixed model 
Errors uncorrelated 
Item effects fixed о? d 
Individual effects random us ghe d ЕВ} o? дос, 


4. Random model 


Errors uncorrelated 
Item effects random 


2 2 
Individual effects random ши 


02 + po? 2 2 
ров 5 406 


Random model 

Errors uncorrelated 

Item effects correlated о? + qo? 2 
Individual effects random ыг v + p(1 - о) ор o? IG, 


6. Mixed model 


Errors uncorrelated 
Item effects correlated e? 4-5 у, 


Individual effects fixed р-І с? + pt - p) 98 o? 4 ха 


T; 


Fixed model 
Errors correlated 
on different items for 


2 
с [1 + (а- 4 2 
same individual Па 901 рл Хау 


2 р 2 
° 1-1) + тв? 0201 - о) зат Pod + 09 


8. 


Mixed model 
Errors correlated as in 7 
Individual effects fixed 


v [1*(a- 1)0] +4 zo? 
Item effects random )o] + p-i?9i 


PU - о) + род 01 - р) 


9. 


Random model 
Errors correlated as in 7 ‘ 
Item effects random ce [1 + (4 - йә] + ай 


Individual effects random (1 - 5) + рођ (1 - p) 


са 


НОУТ. 


ап infinite population of subjects or a finite one с 
size P, on the whole domain of Q items. 
m ce oi 2, 3, and 4of Table I show the expect: 
АНЬ ues of the mean squares components in nin 
rent models that are each specified by the as 
sumed characteristics listed in column one. Thes 
nine models are all useful for specifying interne 
consistency reliability coefficients if Xij is con 
sidered the score earned by i - th individual's re 
Sponse to the j - th item on an examination. 1 
these nine models the interaction term is used a 
error as is usually done in cases providing onl 
one observation рег сей. By making different as 
sumptions about the parameters in the equation tl 
nine different models each described in succes 
sive lines of Table I arise. There are а пиши 
of other models which seem useful and for whic 
we have not yet studied the variance componen 


and reliability estimates. 
The first six entries i 

ble Iare the expected val 

of the true scores ог the numerator of the reli 

bility coefficient appropriate for each line of t 

table. The denominators of the reliability coeff 

the entries in column 


last three lines, tl 
m to contain то 
the true scores. Henc 


n the last column of T: 
ues of the mean squar 


true scores. 
x us consider now the first four models 
ich all the effects represented by the terms 


the equation are independent. In these cases, € 
edure for estimating 


notes that th e usual ргос 
icient gi 


internal consistency reliability сое ff 
tio of the variance 


саша estimates of the га 
e true scores to the variance of the obtain 
Scores, The last column of Table I gives only 
тін of the reliability coefficient obtained 
page one minus the ratio of the error mean squi 
to the among students’ mean square. The deno 
me of the reliability coefficient is (A) of 
orresponding row or the mean square for аш 


individuals. Rows 5 and 6 show eXP ected val 
or random and mixed models, respectively, 
ated with one. 


Which the item effects are correl 


- KRISHNAIAH 956 


Ї 


other to the same extent (о) for each pair of items. 
The column indicating the expected value of the 
‘among items” mean square gives an expression 
involving this common correlation о betweenitems 
However, as long as the errors of measurement 
in the several items are considered independent of 
each other and independent of the item and indi- 
vidual effects, the usual mean square ratio for 
estimating the coefficient of reliability gives a nu- 

merator which is an appropriate estimate of the 

variance of the true scores. 

In models 7, 8, and 9, however, the numera- 
tor of the usual estimate of the reliability coeffi- 
cient contains а term oqo? which is not zero un- 
less o is zero. In these three models, it is а5- 
sumed that the errors of measurement on pairs of 
jtems are correlated with a common co-variance 


for each individual. 


FOOTNOTES 


* Work on this paper was partially supported by 
the Bureau of Educational Research, Univer- 


sity of Minnesota. 
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ON TEACHING F INSTEAD OF 1 


NORMAN H. ANDERSON 
Yale University* 


IT IS WELL known that the square of at ratio 
on n df is interpretable as an F ratioon 1andn df. 
For the case of oneor two means then, the values 
of t and F are equivalents, and lead to identical 
probability and inferential statements. Since F is 
the more general, applying as well to the case of 
more than two means, one might well suspect that 
the t test could be retired from use. Because the 
t test constitutes sucha large part of current sta- 
tistical pedagogy and practice, it may be useful to 
consider the matter in more detail. The main 
considerations which seem tothe writer to be rel- 
evant are summarized briefly below.! These con- 


Siderations are applicable principally to the first 


few statistics courses in the applied sciences and 
have less relevance to more theoretical courses. 
In favor of t, it canbe argued that it has great- 
er intuitive appeal. То measure the difference 
between two means against its standard error may 
be simplerto comprehend and result in easier 
learning than does the variance partitioning tech- 
nique of F. However, as soon as it is admitted 
that t is insufficient to meet everyday needs, 50 
that F must be taught anyway, this argument loses 
its power. It may also be suggested that whatever 
greater immediacy of meaning may attach to t is 
Not entirely its natural due, but rather а product 
of the present organization of statistics courses. 
The t test may also be preferred because of its 
i plems arising in esti- 
f confidence in- 
it makes little 
ceeds from t orfrom F. 
nfidence interval 


em with F, the 95 percent сот ا‎ 
Obtained usi 32. F(. 95; 1, n). e in = 

ng Vem ЁС 2282) could then be tak- 
еп on faith (and even serve us efully as the basis 


ar the signifi stforthe case of one or two 
means), dumm. d s t be admitted that the 
Confidence interval idea seems easier to compre- 
ind When it is based on the standard picture 1 
vi € symmetrical t distribution. This possible E 
a of t must be measured against its 5 а, 
ee in problems of significance testing whi 
те discussed below. 

A further point in favor of t st 


sas OOT 


* " 
Footnotes will be found at end of article. 


emsfrom à dan- 


ger of the analysis of variance. Ordinarily, when 
pair-wise comparisons among several conditions 
are to be made, the error mean square from the 
overall F analysis is used throughout. When the 
condition variances are reasonably equal, this 
procedure gives increased sensitivity since the 
erroris estimated more precis ely. If there is 
marked heterogeneity of variance, however, such 
tests may be biased, and it may be preferable to 
test each pair of conditions using only the data 
from that pair (although this does not entirely es- 
cape the difficulty). Of course, the two condition 
F could be used here as easily as t. Neverthe- 
less, it must be admitted that routine application 
of the analysis of variance does facilitate over- 
looking the problem. 

Against these arguments must be set a number 
of negative aspects oft. For situations with just 
one or two conditions, of course, it is a matter of 
statistical indifference whether tor Е is used. 
Either yields the same inferences, as noted above. 
Moreover, since Е-Е, there need be little differ- 
ence in computational efficiency. Itmay be noted, 
however, that the standard computing rules for t, 
in contrast to those for F, are not efficient and 
are thus more liable to arithmetical error. 

When more than two conditions are involved 
piece-meal t testing presents certain danger 52 
Тһе опе commonly emphasized is the increased 
risk of thereby obtaining “significant?” differences 
which the data do not in fact support. Curiously 
enough, the t test approach also increases the 
likelihood of the obverseerror, namely, failing to 
detect results which the data are quite adequate to 
declare real. This can happen in a number of 
ways, which trend tests may serve to illustrate. 
In investigations in which one of the independent 
variables is numerical, the between conditions 
variation can often be expected on prior gr ounds 
to reside largely in the linéar, or linear + quad- 
ratic, component of the curve. Pair-wise com- 
parisons of means may fail to reveal even a fairly 
strong real effect which a trend analysis F will 
elucidate with ease. 

A more fundamental objection to the t test ap- 
proach concerns the original planning of the inves- 
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igati researcher who thinks in terms of 
a Lee to use as incisive and informative 
a design as canbe had for the allotted time and ef- 
fort. Even when the basic plan is sound, a poor 
superstructure can, and not i nf requently does, 
prejudice the analysis and interpretation, or en- 
tail considerable extra computational cost. | 
Fisher’s commendation of the use of factorial 
designs for controlling extraneous variation may 
be cited in illustration of this last point. Even 
with only two main experimental conditions, it is 
often desirable to counterbalance a second vari- 
able, not of great interest in itsel f, in order to 
obtain added generality. Unless set up so that the 
two-way analysis canbeand is used, however, the 
counterbalancing may do more harm than good. 
In a similar way, it is often possible to use as 
a second independent variable some individual dif- 
ference measure which is correlated with the re- 
Sponse measure under study. This procedure, 


ocks” factorial de- 


; that F alone should be taught. To 
two sets of rules where 
ident use of the Student’s 


For statistics courses in which time is insuf- 
ficient for both tests, the 


complish the same ends as the choice of t. 


permit the development 
ramifications of the analys 
two-way designs, specific 
tests. Moreover, teaching F from th 


er learning 
Ount of nega- 


ilable to the writ- 
er, admittedly anecdotal in nature, Suggests that 


Students who have learned t are reluctant to aban- 
don this skill when necessary. Havinga technique 
which may actually, if not always advisedly, be 
used, they tend to regard even workaday designs 
as “Тапсу” or “‘not worth the trouble at this stage 
of investigation. ’’ Thus, if they us 
all, they do so only with pain, and so fail to ac- 
quire the familiarity with the new technique that 
would come were it regarded as just another ap- 
plication of the F test they already knew, 


€ the design at 


It would thus appear that t is superfluous, since 
F may always be computed instead; inadequate, 
Since it does not apply as gener ally as does F; 
dangerous, since its use may increase errors of 
inference, as well as lead to weak experiments; 
and a waste of time, since teaching F will provide 
more transfer to later learning. On the balance, 
it seems fair to conclude that the t test should be 
struck from the curriculum, 


More practical experience is neededin order to 
integrate the anal ysis of variance with beginning 
Statistics on the conceptual level. However, it is 
appropriate to sketch briefly the sequence of com- 
putational routines which is indicated by the argu- 
ment of this note. 

1. Computation of the sample variance is done 
using the analysis of variance table: 


Source df 55 MS 
Е, 2 ЗЭ? 
Меап 1 T?/N 

Error 


(Variability) N-1 >Х2-тТ?/м 


The entry, MSerror, then furnishes an unbiased 
estimate of the population variance. The scaffold- 
ing provided by the tabular format is unnecessary, 
of course, but seems to Serve a useful mnemonic 
function. Its real benefit accrues in Step 2. 
ificance for a single sample 
carried out on difference 


3. With two samples of difference scores, the 
analysis of variance table is extended one steP 
further: 

Source df 55 MS Е 
Меап 1 

Between 

Groups 1 

Error N-2 


a И 


4. It is str aightforward to extend Step 3 to Ше 


ase of several samples and it seems advisable 10 
do so promptly in order to avoid the developmen 
ofa Superstition that the technique only works aea 
one or two. The routine remains the same 
the scores are arbitrary, rather than ери 
Scores. Itis only necessary to observe Шау 
F for Mean is no longer meaningful or impor 
and may be omitted from the table. 


SS 


% 
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5. A simple extension of the computing routine 
yields the calculation of r for a single sample: 


Source хх ХҮ FE 


Mean Ty/N TxTy/N Ty/N 


Error EZX'-Ty/N ZXY-TyTy/N ZY'-Ty/N 


The square of the correlation coefficient is esti- 
mated from the entries in the Error line, r? = 
SSyy/SSxxSSyy- The test for significance may 
be made by consulting a table for r or, with an 
additional calculation, by using F. When the cal- 
culation of r is taught this way, there is less like- 
lihood of confusedly agglomerating the between 
and within samples regressions when more than a 
Single sample is involved. 

It is suggested that ZX? be given for most of 
the homework problems. This will greatly de- 
crease the time spent on each problem and, by 
thus allowing a greater variety of problems to be 
assigned, increase thelearning efficiency. Wheth- 
er or not further applications of the analysis of 


variance are given in the first course will depend 
on the available time, and on whether the course 
is oriented toward experimental or observational 
research. 


FOOTNOTES 
* Nowatthe University of California, Los Angeles. 


1. For the purposes of this note, the terms SEHR 
test" and ‘‘analysis of variance" will be taken 
as synonymous although this is not strictly cor- 
rect. It will be necessary to allude to a num- 
ber of uses of the analysis of variance, but 
these uses are well known and will not be doc- 
umented. 


2. A specific comparison based on some reason- 
ably firm rational or empirical ground may al- 
ways be tested, of course. However, unless 
itis based on a single degree of freedom, it 
wil require an F ratio in its own right. It 
should also be specifically noted that multiple 
comparison procedures will be superior to F 
(and t) in certain situations. 
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COMPUTATION OF VARIANCE ACCOUNTED 
FOR IN MULTIPLE CORRELATION 


CLINTON I. CHASE 
Idaho State College 
Pocatello, Idaho 


WHEN MULTIPLE correlation techniques are 
used in the analysis of data, the investigator usu- 
ally wishes not only to discover a combination of 
variables which will yield maximum prediction of 
а criterion, but also he wishes to determine the 
amount of variance in the criterion which is asso- 
ciated with each of the independent variables. For 
this purpose Engelhart(1) has 5 ubmitted the 
formula: 


R? = 85, + Boz +.. . Bon + 2801802712 + 
Вол Возта з +. +“ 280, n-1BonTn-1,n [1] 


Following this procedure, the ithbeta squared will 
Produce the variance directly associated with the 
ith variable. But since the ith variable is usually 
correlated with other independent variables, ital- 
80 enters into variance which is shared by several 
of these other independent variables. The total 
amount of this shared variance is given by the 
Summation of all of the terms of the form 280180) 
Tij which appear in formula[1]. Then, according 
to Engelhart's procedure, the shared variance 1s 
divided among each of the independent variables 
11 the same proportions 45 the direct variance. 
The total amount of variance associated with the 
ith Variable, then, would be the ith beta squared, 
Рива proportional amount of the total shared 
Variance, 

Since Engelhart's procedure for partitioning 
he shared variance is admittedly arbitrary, the 
Purpose of this paper is [0 present an alternate 
Procedure, which, itis believed, is somewhat less 
arbitrary in its method of аррог tioning shared 
variance, since here the basis for partitioning the 
Shared variance is found in mathematical manip- 
ulations, 


с- 
As а necessary partof nia Бе. 


this presentatio 


в = 
R? = Вол Го! + ВогГог + ВозГоз +- -Bonton 


[2] 


Using this procedure, one may find the total 
amount of variance associated with the ith vari- 
able in the product goiroi. Although form ula[ 2] 
is simpler to use thanformula[1], especially when 
the number of independent variables is more than 
two or three, it has the disadvantage of not yield- 
ing a break-down between variance which is di- 
rectly associated with the ith variable, and that 
portion of the variance shared among the several 
independent variables which is to be associated 
with the ith variable. 

It therefore seems that some technique is de- 
sirable which would retain simplicity and economy 
of effort, while yielding a maximum amount of in- 
formation. Such a system is found in the mathe- 
matical analysis of formulas Шала [2] as shown 
below. 

Inaset of norm 
square estimate oft 
ear-regression equation, 


al equations for the least- 
he betas for the multiple lin- 
the r's are found as fol- 


lows: 
гол = Во: + Гл гВог + Га зВоз + Fia Boa 3 
To2 = Boz + гл 2Во1 + Газйоз + Г24 Во4 4 
гоз = Воз + Га зво: + ГазВог + ГзаВо4 5 
ro4 = Воз + га во: + Г24 Вог + Гза Воз [6] 


Therefore, the right hand portion of [ 3] can be 
substituted for го, in formula[2], the right hand 
side of [4] can be substituted for гог in[2], and 


во оп. The result would be as follows: 


В? = Во: (Во: + Гъгвог + Гъзвоз + rı4 Boa) + 
Вог (Вог + T1280: + T23Bos + r24ß04) + 
Воз (Воз + T1380. + гозВвог + ГзаВо4) + [7] 
Во (вод + ГаВо: + Г24Вог + F34 Bos) 


ond technique (2:409-10) for computing the vA 
За the criterion associated with any Lye И Multiplying to eliminate the parentheses one may 
ni lent variable must be noted. Suc find the following results: 
Que is provided in the formula: 
ыг : с received from В. Е. Jarrett of the Department of Psychol- 
he author wishes to acknowledge the > d. 


98y, University of California, Berkeley, 
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+r + 
R? = 80, + глгВогво, + Г1зВозВо: Ж S и 
В г + Г2Во1Во2 + ГгзВозВог + r24 Во4Вог [8] 
Вбз %ГізВо.Воз + ГазВо2Воз + F34 Во4Воз + 
864 + Гм Во: Bos + Гоа ВогВо« + гзаВозВог. 


ow be seen thatsumm ating like terms in 
[5] will produce formula[1] as it would be written 
in a four variable problem: 


2 2 2 жы + 

R-96i + Вог + Воз + Boa + BoiBozria 
кы + 2Во:Во4г14 + 2Во2Возгоз + [9] 
2ВогВо4Гг4 + 2ВозВочгз 


it is apparent 
variance as- 
as given by the Boiroi 
is actually equal to the beta 
of the summation of all the 
formula [9], of the form 


tributed in the same p i 
directly associated 


. 0f computation 


hart proposed. Noting the relationship dap 
formula [1]andformula [2], one may now Meg en 
both the variance whichis directly associated wi d 
the ith variable, andthat portionof variance share 
among the independent variables which is to be iaS= 
sociated with the ith variable, by using the sim- 
pler procedure as given by formula [2], with s: 
addition of a further, simple computation. First, 
the Војгој product is found, yielding the total (a 
rect and shared variance associated with the it 
variable. Then Во; is Squared, producing the var- 
iance directly associated with the ith variable. 
This figure is now subtracted from the Воігоі pro- 
duct, the difference being that portion of the Wards 
ance, shared by the independent variables, which 
is associated with the ith variable. 

It is believed that this method provides two ad- 
vantages over the procedures represented by either 
formula [1] or [2] alone: first, it yields maximum 
information concerning a given variable’s partici- 
pation in the variance accounted for in the criter- 
ion; and second, it materially reduces the number 

5 necessary to secure that amount 


of information 
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ON A TEST OF WHETHER TWO SETS OF OB- 
SERVATIONS BELONG TO A CERTAIN CLASS 
OF DISTRIBUTIONS WHEN THE RANDOM 
VARIABLES ARE NOT INDEPENDENT 


H. P. KUANG* 
University of Minnesota 


THERE EXIST several techniques which have 
been developed in recent years to test the hypoth- 
esis that two samples have been drawn from the 
Same population. These developments were based 
on signs, ranks, or the randomization principle. 
For instance, to test the hypothesis that the two 
Samples are from the same population, Mood con- 
sidered a test based on signs while Dixon pre- 
sented a statistic based on ranks. Utilizing the 
randomization principle, Pitman prop osed the 
difference of sample means asa criterion, large 
values being significant. Wald and Wolfowitz de- 
vised the total number of runs in a sequence for 
testing the same hypothesis thattwo samples (not 

ecessarily of equal size) have been drawn from 
a common population. The tests developed are 
essentially limited to the continuous and independ- 
15 variables. We shall, however, treat the prob- 
em without these restrictions. It is the purpose 
9f this paper to present a probabilistic inequality 
Which may be used to determine whether two vari- 
ables are contained in a certain set of distribu- 
tions without theassumptions of continuity and in- 
dependence. 

Let us consider the two random variables X 
and Y which аге not independent. Suppose that 
bo of the variables is divided into а finite num- 
Рег of parts А, ----- Ay, and let the correspond- 

ng discrete marginal distribtuions of Х and Ү be 


Е, = {р i tivel 
рі] and F; = {qi} ( = 1 -- г) respectively, 
80 that pj = P(X = a ai = P(Y- Aj) and 
F r 1 
> рі-і1 > ат 
ій і нан і 


"X further, the cor responding observations x 

1 Y be {ni} and {mi} (i21---r) respective- 

a 50 that the sample values ni and mj belong 
the set А; and 


* - 
The author is currently Professor of Mathematic 


Ollege in Greensboro, North Carolina. 


Е . T 
E п-п, Х mj-m. 
ip bc 07 


Апу set of Ај carries themass nj inthe distribution 
n 
of X and the mass mij inthedistribution of Y. The 
m 


object here is to decide on the basis of two sets of 
observations (пі) and (mi) (i = 1 --- г) whether 
the two distributions Е, and Е, belong to the same 
class of distributions. 

Using the Minkowski inequality, the author has 
proved in his unpublished dissertation (University 
of Minnesota) that if the two distributions F, and 
Е, belong to a certain set of distributions, the fol- 
lowing probabilistic inequality must hold: 


2 жә 
-1 
УСНИ > 

ME ВА а 

ola үш 


г — 
where Q(F; F2) 52, ш. Ti and € is any posi- 
tive numerical value. It should be noted that the 
derivation of the inequality was based on the con- 
ditions that n > r, m > г, and that when ni <1 or 
when mi «1, 1# 1. By use of the above result we 
can then evaluate an upper bound of the probability 
in terms of a pre-assigned risk, с. It is easily 
seen that if Е, and Ег have the same distribution, 


we get 
rê + r-1 Em 2 
JV a (Yn vm 


D(F, Е.) >1- 
Аза numerical illustration, consider the two 


је 


5 at The North Carolina Agricultural and Technical 
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distributions of grades which were received by 
students enrolled in courses in calculus and dif- 
ferential equations at a university. The number 
of students assigned to each of five grades A, B, 
C, D, and F for these two courses were as follows: 


BCD в" род 


іе 


Calculus 12 31 52 57 48 
Differentials 


Equations | 2 20 44 50 44 


The calculus was aprerequisitefor the course 
in differential equations. These two courses were 
composed of not only the same individual students 
but also some twins, broth: 


ers and sisters. Very 
few of these students were, however, transfers, 
It frequently happened in m athematics courses 
that students with a good 
ally made a good grade i 
On the other hand, stude 
calculus were ordinaril 
tory work in differen 
reasons, the two di: 


If we use a risk, a, of .05, say, we get 


Е? /25+5-1/ 1 u) = .5395 . 
205 


У200 ¥160 


Thus Q(F,F;) >1- 


Therefore, the two distributions of grades in 
question belong to the same set of пила, 
Finally, it may be added that if Ше rando 


variables X and У are assumed to be independent, 
we then have 


> 


16(r? +r - 1) 
Ре, 21-60) у1-Мй 22 
1.1 256(r? + г - 1)? 

па 722) + ЕЁ n? m? 


when F, = F, 


А , t 
It canbe readily shown that the previous pem 
(without the assumption of independence) is à 


ter bound of probability when the following inequal- 
ity holds; 


2 +г-1) 
15(п2 + m?) - 4/nm (Vn + Vm)? > ne 


с р Е 
ni 
Calculus Ai 060 2155 260 2285 .240 
iff tial E Р mi 
Differential Equation mi .013 .125 275 312 275 
ni Mj 
за gn 028 „139 
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THE PERCEPTION OF MUSIC SYMBOLS IN 
MUSIC READING BY NORMAL CHILDREN 
AND BY CHILDREN GIFTED MUSICALLY* 


ROBERT G. PETZOLD 
University of Wisconsin 


1. Introduction 


Introductory Statement 
THE COMPLEX patterns of sounds which are 
to certain limitations, 


called music may, subject 
be indicated with а relatively high degree of ac- 
curacy through the use of an equally complex 8у5- 
tem of musical notation. This system utilizes 
three types of Sy m bols: (1) those which indicate 


the pitch of musical sounds; (2) those which indi- 
cate the temporal durationof musical sounds; and 
(3) those which indicate certain related aspects of 
musical interpretation such as tempo, dynamics, 
phrasing, andstyle of performance. Music read- 
ing is a highly complex process because it in- 
volves not only the recognition and interpretation 
of these various symbols, but frequently requires 
à simultaneous response in terms of vocal or in- 
Strumental performance. 

A study of the p rofessional literature shows 
that several terms may be used to identify the 


process of reading music, ОГ to identify a specif- 
ic kind of level of music-r eading activity. The 
ic reading as the pro- 


present study defines mus : 
сез5 of reading and interpreting the various kinds 
of music symbols and converting these symbols 
into sound. This does not imply accurate герго- 

gest that the in- 


eaning of the ѕут- 
a high level of 


correct response. The symbols become aguide 


for action, with the indivi 
value judgments regarding 
кошы се, апа соп5 
Ormance to agree Wi 
M eg educators аг skills 

of music readini 173 
Velopment of the ша, This type of skill 15 


considered an essential element of musical under- 
standing and appreciation, as well as of independ- 
ent musical performance. Consequently, the mu- 
sic program of the school gives considerable em- 
phasis to a variety of activities designed to 
facilitate the development of this skill. However, 
there has been little sys tematic research con- 
cerned with the several aspects of music reading 
although considerable discussion has taken place 


regarding the following: 


1. Adequate identification of the learning pro- 
cess as it applies to music reading. 

2, Development of e ffective procedures for 
teaching music reading. 

3. Designing effective instruments to measure 
and evaluate music reading competence. 

4. Determining reasonable levels of music 
reading competence for children represent- 


ing various grade levels and various degrees 


of musical ability. 


Purpose of the Study 


y seeks to develop experi- 
mental learning 5 ituations which would provide 
significant info rmation regarding the perception 
mbols by children of average musical 
gifted musically at the fourth 
The process of reading 
music appears to depend upon three percep tual 
levels: 1) the au 
sounds; 2) the visual perception of musical sym- 
bols; and 3) the integrative, internalized process 
ich the indi vidual organizes previous 
isual perceptions ofgiven stimuli in 
these same, or similar, stimuli 
new learning situations. Each 


The present stud: 


order to react to 
as they appear in 


th the U. S. Office of Education, De- 


tract wi 
у i rmed pursuant toacon 
The research reported herein was зүлгэн The writer wishes to acknowledge the assistance ана» 
elementary schools; their special asic. 


Operation gi ildren 0 
ation giyen BY Б я: Superinten 


teachers; and Dr. philip Falk, 


Partment of Health, Education, ап : Wisconsin 
adison, isconsin, 
the M f Schools, and 


Mr. LeRoy Klose, Supervisor of Music. 
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of these three Perceptual levels will be explored 
experimentally, comparis Ons will be made in 
terms of the two groups of children at each of the 


two 


This study deals primarily with Teading those 
music symbols which indicate pitch; other kinds 
of symbols are tobe considered in future investi- 
gations. Many music educators recommend that 


JOURNAL OF EXPERIMENTAL EDUCATION 


visual and aural identification and recogni- 
tion of tonal Configurations. It will also es- 
tablish, in part, the basis for sel ecting 
learning tasks appropriate to Phase II of the 
Study and provide an opportunity to study the 

4 techniques appropriate for the collection and 
analysis of the data. 

4. Construction of the test materials for Phase 
П of the study and administration of the test 
to 150 average and musically gifted children 
in both fourth and Sixth grades. Phase II is 
Primarily concerned with Perception as an 
integrative, internalized process through 
Which the individual relates his auditory and 


grade levels, and hypotheses will be tested, 


: response is y d, it is diffi quio 
3. p v significant differences between е subject b 5 Ге безен to Judge и Па 
es oe а tonal Configuration Musical Symbols bei стаи: т у 3 
ur y and € visual perc €ption of these Sponse 1 арена Ms ос pesi ые 
same configurations, i челак 
4. 


Study are brief 
ly outlined below and will be discussed їп detail 
in a later section of this report, 


1. 


in Singing as а Part of the reg 
an 


instrumental response would have 
further restricted the Study to a relatively small 


Analysis of Song materials to identify com- 
mon tonal Configurations which may be used 


as test items. 


Skill in па ош € problem of developing 
. Selection of subjects for the study ang col- discussion ж reading has been a fertile topic for 
lection of certain personal data on each sup. {ба бї vs ше 5. Magie literature for 3 pd 
| 5 Sic о 
јест. я а ум шапу Suggestions and 1С educators have 
‚ Construction of an initial test апа adminis- 


erenc 
tration of this test to approx ote 


Subjects selected at random fr tives of 1h Машу Specialized skill, The: обеси 
i Stated b mus 10 program in the schools, as 

У authorities in the field, have always 
1964 for ас quiring а certain level 
ng skill. There exists, however, а 
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considerable amount of uncertainty regarding ex- 
actly why children need to learn how to read mu- 
sic, as well аз арепега! tendency to avoid making 
specific recommendations regarding effective and 
adequate procedures which might be followed. 
One valid reason for the vagueness with which 
these two points are discussed may well be due to 
the lack of adequate research information. 

Theliterature concerned with the topic of mu- 
sic reading may be grouped into three broad cate- 
gories: 


1. The historical background of musical nota- 
tion and of syllabic devices for interpreting 
musical notation. 

2. Professional literature which includes 
courses of study, specialized music educa- 
tion methods books, and the many articles 
in professional jo urnals which report the 
Opinions, experiences, and recommenda- 
tions based upon personal experience. 

3. Research literature. 


For the sake of brevity, the writer will present 
Only a partial summary of certain pertinent re- 
Search literature. " 
Analysis of the research in the spec ific area 

Of music reading reveals the following pertinent 
information. Onegroupof studies has approached 
the problem through a detailed analysis of the eye 
movements of persons engaged in reading music. 
Jacobson (4) followingthe procedures of Gray and 
Buswell in the field of language reading, carried 
forward a detailed photographic analysis of eye 
Movements as a means of identifying the various 
Stages in the development of music reading abil- 
Ну. The section of the report which is pertinent 
to the present study deals with the reading of vo- 
Cal music by three college students and three 
sixth-grade pupils. A detailed analysis of the per- 
formance of each subject is presented and the re- 
Sults are briefly summarized here. The less ын. 
ture reader required pauses almost twice as grea 
as the mature reader but the number of pauses 
did not differ significantly. The span of recogni- 
tion, which included both words and notes; seldom. 
exceeded two notes for the mature reader an 
Was smaller than the span of the earlier шаи 
Mental reading. The smaller intervals were те 
Correctly more often than longer intervals, dcus 
tonic sol-fa syllables re sulted in longer ая 

ап when words were used; and sol-fa ву! аш 
Were found to be of no benefit to the ас ын т” с 
author offers several suggested E p ni bal 
the results of this study to classroom instr ТОРТ 
Which seem to indicate that new and га Зэн 
reading materials are needed to develop 
mature reader. 

Ortmann (7) summarize 


ons by pointing out that the 8 
&reat deal upon the placemen 


iga- 
of his invest 
à a of vision depends 


tand pattern opie 


notes. He found the field of clear vision to be a 
circle varying between 1/4 and 1/3 inch in diam- 
eter with no difficulty encountered in reacting to 
notes within this area. The chief determinants of 
difficulty in note reading, not involving actual eye 


` movements, are: a) the number of notes in Ше field 


of clear vision, b) the area coveredby the distri- 
bution of the notes, c) the number of linear di- 
mensions involved (1.е., combinations of vertical 
horizontal, or oblique), d) the complexity ог sym- 
metry of the note pattern, and e) the meaning of 
the note group as a melodic and/or harmonic pat- 
tern. He points out that difficulties in note read- 
ing, for the normal pupil, result from an inability 
to group a distribution of notes into larger percep- 
tual units. Familiarity with musical notation is 
quite obviously a significant factor in accuracy of 
reading. 

A third study of eye movements іп music read- 
ing was carried on by Weaver (10) using trained 
musicians, withthe piano as the performance me- 
dium. His experimental reading materials were 
harmonic, melodic, and single melody with sup- 
porting chords. Therhy thmic element was re- 
tained in all types as a means of accurately meas- 
uring the number and duration of reading pauses. 
The results indicate that note reading relates 
more closely to reading algebraic formulae than . 
to language reading, assumingthat one mus ical 
note is equivalent to one word. The average num- 
ber of notes played per reading pause varied be- 
tween one and two, with spacing of notes and har- 
monic grouping being important factors. The du- 
ration of reading pauses depended upon the type of 
musical material being read, with those for har- 
monic and melodic material reliably shorter than 
those required in reading an accompanied melody. 
A higher correlation exists between reading time 
and pause duration than between reading time and 
the number of pauses. Weaver also found that per- 
formance in his study providedlittle basis for dis- 
tinguishing types of readers because of the individ- 
ual differences in performing the different selec- 
tions. Itis significant thathe identified the average 
perceptual span as varying between three and five 
notes. À companion study by Weaver and Van 
Nuys (11) was particularly concerned with the re- 
lationships between the rhythmic and melodic ele- 
ments of music as they affected occular and man- 
ualbehavior. The immediate m emory span Was 
used as а criterion of difficulty in determining the 
eye-hand span, with immediate memory span de- 
fined as the am ount of material which could be 
visually explored, perceptually organized and re- 
tained until played. The results showed that mem- 
ory span decreases as the complexity of note re- 
lations increases for bo th rhythm and melody. 
The memory span for a combination of a simple 
rhythm and m el ody is only slightly smaller than 
the span for each alone. In more complex combi- 


nations, the melodic element seems to prove 
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more difficult than the rhythmic. Melodic factors 
limit memory span whenever the melody is not го 
tremely simple; rhythmic factors limit the rate о! 
reading when complex rhythms are used. If mu- 
sic reading skill is to be developed 50 more гар- 
id responses are made, individuals need to im- 
prove in the ability to apprehend pitch patterns as 
stable melodic segments and to grasp rhythms as 
ic figures. 
он to determine the complexity of 
the musical pattern which could be perceived at 
one fixation of the eyes by individuals with vary- 
ing amounts of training. He also studied the ef- 
fects of tachistoscopic training on the Span of per- 
ception. The investigation showed that the ability 
to grasp a musical pattern of three or more notes 


determined by the 
ion of notes on the 
upings, and the ir- 


ing of typical music. Stokes (9) investigated the 
effect of tachistoscopic training on reading me- 
lodic material. The resul ts, however, showed 
that the gains on a post-test could not be consid- 
ered due to the influence of the Specific training 
program alone. Aninteres ting instrument for 
testing the sight-reading performance of instru- 
mental performers was constructed by Stokes and 
contains several valuable suggestions. The test- 
ing results show, however, that tach is toscopic 
training had no effect upon general achievement 
in reading music and that there was little evidence 
of carry-over. 
Smiley (8) investigated the music reading ‹ 
саршагу” in basic music texts 
grade level and found the total nu 
configurations to be far too many for the child to 
respond to readily. The 351 configurations could, 
on the basis of frequency of use, be reduced to 
approximately 20, each contain ing from two to 
five notes. Further analysis led Smiley to con- 
clude that it might be more valuable for the child 
to learn a basic list of two-note Configurations of 
seconds and thirds. The procedures Suggested by 
Smiley for identifying tonal configurations were a 
valuable guide for the analysis made by the 
writer. 


“уо- 
at the fourth- 
mber of tonal 


This reported research in music readi ng has 
been primarily concerned with certain aspects of 
visual perception or of physical behavior during 
the process of reading music. Such research has 
not yet provided an adequate basis for prescribing 
instructional procedures to develop musical con- 
cepts which may be readily utilized in a total mu- 
Sical setting. Apart from a few reports of actual 
classroom experimentation in the problem area, 
there is little information regarding the ways in 
which children of elementary school age react to 
musical notation because the majority of the 
Studies have used older s ubjects. The survey of 
literature again re-emphasizes the need for con- 
tinued, systematic investigation of this, and other 
problems in music education. 


Ш. Procedures of the Study 


A. Analysis of Songs 
12278515 oi songs 


The initial step taken inthe study was a detailed 
analysis of 326 Songs selected at random from 
eighteen of the basic song-texts used in grades 
three through six. Since the subjects for the study 
were tobe selected from the fourth, fifth, and 


sixth grades, it was necessary to use test items 
with which th 


wellas items 


the frequency of appearanc 
figurations in songs. 

The procedure 
Configurations we 
ley (8). Other 
teachers’ manuals to the basic 


approach to music agin? 
re setup toserveas a guide 
iguration: 


1. Thirt 
of random 


Songs containj 
Passages, 


2. All types of Songs were included in the anal- 


ysis, no distinction being made between specific 
Reading Songs, rote Songs, or rote-note songs. 
3. Single intervals were generally not consid- 
ered as tonal Configurations since they donot im- 
ic flow or movement. 
4. The interval between musical phrases was 
not usually included, 


5. The interval within aphrase which appeared 
between the end of One distinct configuration ап 
the beginni 


ПЕ Of another configuration was not in- 


Y 
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cluded unless the text showed that such an inter- 
val occurred in the middle of a word. 
Sè 6. The interval between readily identifiable 
г quences was not included, each sequence being 
reated as a separate configuration. 

7. Repetitions of single notes were considered 
z single notes; do-do-re-re-do-do- was analyzed 
= do-re-do. The decision to exclude repeated 
Ones from c onfigurations was made in order to 
Simplify the analysis even though such repetitions 
occur frequently in songs. 
ar 8. In part songs, each part was analyzed sep- 
d ately only in those sections of the song where 

e soprano and alto parts Were different. 

9. In songs containing repetitions, only the new 
ending was analyzed, not the entire repetition. 
10. Frequently the rhy thmic structure within 
the measure or phrase could be used to assist in 
s identificationof a complete tonal configura- 
n. 

211. The text of ће song itself, particularly 
With reference to words of more than one sylla- 
ble, could frequently be depended upon to be of 
jBistence in accurately identifying the tonal con- 
igurations. 
in 12. Within these guides, 
b Е the lengthof the configurations had to be made 
E the investigator in term s of what the child 
саша be expected to grasp. As a result, tonal 
+ nfigurations in excess of five or six tones were 

arely identified as such. 


the decision regard- 


merical res ults of 
г of different tonal 


Маз 4575, with a frequency range of 

Significant that 47 tonal configurations, OF only 

-42 percent of the total number identified, with 

„гедпепсу range of 20 to 213, had а combined 
ое зепсу count of 2585, or 56. 

frequency count. Even m i 

€ fact that 22 configurations, OT not quite four 


Percent of the total, with a frequency range of 


ра, had а combined frequency of 
сг percent of the total frequency С 
ДС igurations with a frequency range A 
Counted for 460, or 82.44 P t of the to 
Umber of configurations, but nad а combined fre- 


duenc ent of the 
t y count of only 1327, ОГ 29 perc 
otal frequency pen The data points UP the fact 


there exi ively 501 5 
t exists a relatively 
fhe configurations which appear ге peatedly in 
Songs children sing. PS 
Well ad songs in a minor mode been ne на 
tory 15 Songs containing chromatic ап ща d. 
atio Passages, the total number of tonal со ig 
558 15 would have been considerably n 
the f The situation is still further COME. 
се fact that each one of these 58 confi 
be written in at least nine differ 


keys commonly used in these song-texts. Since 
each configuration has, therefore, nine different 
notational systems within which it could appear 
the total. possible number of 5022 configurations 
appears to present a very formidable problem to 
the music educator faced withthe task of identify- 
ing effective methods for developing skill in music 
reading. Fortunately, the general outline or shape 
of a configuration remains constant from one ton- 
ality to another despite differences in actual nota- 
tion. However, this p roblem of perceiving and 
identifying configurations when they are presented 
in several different tonalities needs careful inves- 
tigation since it is frequently di smissed as being 
relatively unimportant. 

Since the selection of testitems was tobe based, 
in part, upon the frequency of appearance in songs, 
the twenty-two most frequently used configurations 
are listed in Table Il. For purposes of compari- 
son, the twenty-two most frequently used config- 
urations identified in the analysis made by Smiley 
are also presented. These twenty-two configura- 
tions represented 3.94 percent of the total number 
of configurations identified by the writer and 6.1 


· percent of the total number of configurations ident- 


ified by Smiley. It is significant to note that 
although Smiley confined his analysis to four th- 
grade songs identified as reading songs, his find- 
ings clos ely agree with those of the present 
investigator. 
mon to both 

three configurations not fo 

ion list. These с onfigurations are identified by 
an asterisk. It would be reasonable 


that analyses 9 , 
vestigators operating in 
ing similar criteria 0 
comparable results. 
In order to insure an adequate number of tonal 
configurations from which testitems could be con- 
structed, an additional 25 configurations were list- 
ed. These configurations, with a frequency range 
tedfor 646, or 14.12 percent 
ount and аге listed in Table 
ant to note that there again exists 
petween the analysis made 
Џеу and t at made by the writer. Sixteen, 
xi p of these 25 configurations ар pear on both 
lists. The 47 tonal configurations, with a fre- 
0 to 213, which have been ident- 
, included 35 configurations 
jmilar frequency range as identified by 
currence of identi- 
tof the highest ranking items. 
ompare results of this 
onfigurations identified in 
1 and teacher’s guides to the 
professional books ob kie 
this particular study. 


SA to | 
ен 41 configurations were considered а вш- 
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TABLE I 


THE 22 MOST FREQUENTLY USED TONAL CONFIGURATIONS AS IDENTIFIED IN 
THE PRESENT STUDY AND BY SMILEY 


Smiley’s Study 


Present Study 


% Total 


% Total 
Configuration Frequency Frequency 


Configuration Frequency Frequency 


зо do so do 191 041 
mi re до 167 .037 | solaso 135 ‚029 
вола во 136 .030 | doso 102 ‚022 
mi fa so 115 . 025 mi ге do 99 .021 
do re mi 110 .024 so mi 91 .020 
do tila so 106 .023 | miso 88 .019 
do mi so 93 .020 do ti la so 80 2017 
80 ші до 93 ‚020 do ti do 13 . 016 
So fa mi 92 . 020 до ге do* 72 . 016 
80 mi 90 .020 | so mi до 70 ‚015 
mi so 76 ‚011 do mi so 69 . 015 
SO la ti do 76 ‚017 fami re до 69 +015 
do la so 69 .015 | doremi 66 .014 
“9 "m 68 .015 | mido 65 2014 
do ti do 65 .014 so fa mi 64 ‚014 
80 fa mi ге do 60 „013 do 1а so 60 ‚013 
80 mi so* 58 Ju | ан i: iis: 
mi do 55 .012 mi fa 50 58 .013 
2 а mi re do 58 . 013 
= = 4 E bi * 53 ‚011 
fa mi ге do 54 . 012 so la a 
* 4 
80 do mi* 44 sore | 918 e 
Te mi fa 44 .010 do mi : 
369 


Total 


4 і і ion list. . 047 is 4.7 percent, etc. 
Configurations not found in compan interpreted as follows: «0871 р : 
e 


The figures in this column ar 
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TABLE Ш 


FREQUENCY ARRANGEMENT OF 25 TONAL CONFIGURATIONS HAVING A 


FREQUENCY COUNT OF 20 TO 39 


Configuration Frequency 


Configuration Frequency 


do re mi fa so* 39 so mi* n 
do re do 36 so do* + 
до re mi do* 31 50 fa re do 23 
mi fa mi* 30 fa re* 23 
fa mi re* 30 la ti do* 23 
do do 29 do la do 22 
do ti la* 28 mi so do 22 
re mi re do* 27 SO la so fa mi 22 
do mi do* 27 la so fa mi* 21 
mi re mi* 27 50 do so М 20 
до ге mi ге дож 26 ге fa mi ге до ' А 20 
do so do 26 So fa mi re* _20 
mi so mi 25 Total 616 


*These items were also id 
range of 20 to 39. 


Sntified by Smiley as having a frequency count within this 


TABLE Iv 


6th Grade 
School Boys Girls Boys Giri E Gl v. 
School B 4 5 4 5 А а 8 
School C 5 6 5 а я : 5 
Total 14 16 та 4 И 
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ficient number from which to select test items 
and any further analysis of the data was unneces- 
sary at this time. The analysis was of consider- 
able value in emphasizing a basic problem which 
the writer feels needs clarification if music edu- 
cators are to continue to support the “раНегп” 
approach to music reading. Itinvolves establish- 
ing objective, rather than subjective criteria for 
accurately identifying tonal configurations. Such 
objective criteria might well be established if the 
procedures suggested in the research concerned 
with the photographing of eye-movements were to 
be applied to the problem. Certainly such an in- 
vestigation would also need to consider the ways 
in which the perception of any given tonal config- 
uration might be si gnificantly influenced by the 
factors of rhythmic structure, the accompanying 
text of a song line, and the overall movement and 
direction of the melodic line. 


The subjects for this study were fourth-, fifth- 
and sixth-grade pupils selected from three of the 
twenty-two public elementary schools in Madison, 
Wisconsin. Limitations of time made it impos- 
sible to derive a sample which. would be repre- 
Sentative of the total population of these three 
&rades, and it was, therefore; decided to select 
the subjects from three schools, these schools 
being representative о: 
The criteria for selection were school size, geo- 
graphic location and socio-economic background 
Of the children. School A is alarge school, with 
an enrollment of 841, located in the east area of 
the city where pupils are drawn largely from the 
Semi-professional group. School B is a small 
School, with an enrollment of 478, located in the 


Central ‘ty where pupils came from 
area of the city p School C is 


the skilled and semi-skilled grouP. de 
9f medium size, enrolling 660 pupils, and loca E 
s the west section of the city in an area largely 
9mposed of the professional group. i- 
иа Sample for Phase 1—Тһе study 18 divided 
о two distinct parts, еас Ч 
Sample. Phase I utilized а “normal” population 
s Pupils Selected at ran 
Ч th and sixth grades of еас | 
differentiated in terms of musi 
Nu were secured and an еди 
тыйа each grade in each SC 
ша, with equal number 
pr ough more than 100 child 
ase І, usable data was sect 


Table IV shows the distribution by 86 


Егаде. 

us The data for several subjects bie А 

of able because they were unable to та: 

9f response called for by the test, 2 r was con- 

enue to aural and visual stimuli. 
dered unwise to administer the ent! 


h of three schools and 
cal ability. Class 


f the overall school system. 


children for whom there could be little success 
on any item. 

2. Sample for Phase III—Phase П of the study 
utilized two reasonably well-defined segments of 
this “погша!” population of the three selected 
schools, pupils identified as those gifted musical- 
ly and those possessing average musical ability. 
Subjects were selected from grades four and six 
for each of these two groups. The initial step 
taken in the identification ofthese two groups Was 
to establish criteria which could be followed. 
There is considerable discussion in professional 
literature regarding the giftedchild, but there ex- 
ist few criteria for identification in the field of 
music. Usually such judgments are made on the 
basis of one or more of the following: a) excel- 
lence in musical per formance; b) demonstrated 
knowledge of musical information and concepts as 
measuredby achievement tests; c) results of mu- 
sic aptitude or talent tests; d) grades in music 
subjects; and e) rating scales constructed for the 


purpose. 
The schools participating in this study 


do not follow à standard testing program to estab- 
lish levels of musical achievement and there was 
no readily available data which could be used for 
identifying possible gifted children. А prelimi- 
nary investigation of available professional liter- 
ature had failed to yield many concrete sugges- 
tions regarding procedures which could be used 
to identify musically gifted children other than the 
criteria which have already been mentioned. Each 
nced special music teacher, 
his person would be capable 
of identifying gifted pupils if certain criteria were 

ilable. Consequently, arating sheet was 
designed by the write 


considered by music educ: ә 
tors in determining mus? cal accomplishment of 


elementary school pupils. (See next page for rat- 
ing scale.) The rating scale was submitted to a 
group of music teachers for comments and sug- 
gestions and a final form was drafted. The mu- 
sic specialist їп each 5 chool was then asked to 
evaluate all fourth- and s ixth-grade pupils, the 
rating sheets were then scored and tabulated. 
Obtaining evaluations for all pupils made it pos- 
sible to identify for each grade in each school, 
three groups of pupils: a) those receiving the high- 
est scores; b) those receiving the lowest scores, 
and с) those in the middle of the distribution. This 

rocedure led to a more accurate identification of 
the ‘average’ group since it could be assumed 
that they were to be found be tween the upper and 
lower groups. For each grade in each school the 
upper ten percent of the total distribution was ten- 
tatively selected as being musically gifted. The 
mean rating score and standard deviation was cal- 
culated for each grade in each school and the “ам- 
erage' group was tentatively defined as being with- 
in the limits of one-half a standard deviation on 
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PUPIL RATING SCALE 


Name | (Sehool) — —"(Grade) _ 
Hacc ет 
i i ili i i s of 
Instructions: Please check the response which best describes the pupil’s ability or attitude in term 
(а performance on each item.) 


A. GENERAL SINGING ABILITY 


1. Тһе pupil is able to match ind: 
Always 

Usually 

Occasionally 

Seldom 

Never 


ividual tones and intervals: 


| 


2. The pupil is able to Sing a phrase of а song he knows: 


But never Succeeds, even with help from the piano or teacher 
Seldom Succeeds, even with help 


Needs frequent help 
With only occasional help 
Independently, with no help 


the complete song, even with help 
Is never able to Sing the complete song, even with help 


6. Тһе pupil’s Singing, either alone or with the class. ; i iasti ager 
whole-hearted and controlled response: аб ЧЕ ере, + 
Almost never (0-20% of the time) 
Occasionally (21-40% of the time) 
Frequently (41-60% of the time) 
Usually (61-80% of the time) 
__ Always (81-1009, of the time) 


B. RHYTHMIC RESPONSIVENESS 


7. The pupil, when asked to identify the un : 
pes poe and accurately: 5 derlying beat or meter of the 
Always 
Usually 
Occasionally 
Seldom 
Never 


s 50 
Song by hearing it, 4068 


эж 


---- 
ص 
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. The i wi immediate! 
pupil, when asked to repeat common rhythmic patterns hich he hears, does im d 
8 , i y 


and accurately: 
Never 
Seldom 
Occasionally 
Usually 
Always 


C. MUSICAL KNOWLEDGES AND SKILLS 


9. When participating i i i i 

pating in appropriate music reading activities, th i 
A leader, capable of much independent pesa reading цалин ан 
Partially independent, voice 


Occasionally able to control th 
Seldom is able to react to the notation, learns by rote mo: 


Seems to be unaware of the meaning of the notation, canno 


another 
10. The pupil demonstrates а knowledge of common musical symbols and terms used in his grade 


that is: 

Poor 
Fair 
Average 
Good 
Superior 


D. 
GENERAL INTEREST IN MUSICAL ACTIVITIES 
11. In terms of the singing activities carried on, the pupil is: 
i ted and responsive 
i ested and fairly responsive 
but does not resist 


Passive, does not put forth much effort 
Slightly negative, frequently does not participate at all 
Has a strong negative reaction, usually does not participate 


12. In terms of the listening and creative activities carried on, the pupil is: 
Strongly negative, ils to pay attention or show interest 
Slightly negative, frequently does not show interest. 

Passive, sho i a negative ог positive reaction 

Is usually Ver onsive 

Very intereste 


г each sixth grade is slightly 


кол side of the mean. The ratings for each The mean score fo : 
пој E were kept separate since it obviously coul biu ас н Т гер er fourth grade, 
ёр assumed that all three teachers would іп- but it ` not conside ssary to test for the 
Pret the criteria in the same way: significance of the difference at this time. For 
e In addition to the teacher atings, а second cri- the combined grades in the three schools, with a 
mu measure was used to identify the musically total N of 438, the range of scores was 13 to 40 
ted. TheKwalwasser Music Talent T st, Form and the mean was 28. 2 | 
pu Was administered to all fourth” and si -grade A comparison of the data obtained from the 
Чрїїз in the thr hools. The decision to ad- teacher ratings and the Kwalwasser test showed 
inister this 2 „7 je ав pos п the that a rela tively low correlation existed between 
Practical n e dec тээн 4 standardized talent the two criterion measures. Combining all pupils 
езі which could c iy en within à single twenty in both grades in each school and calculating the 
ae minute oe Е : нар ан ӨГӨН звіс Р стой coefficient of pee for each school showed 
he schools. Th Е administered to 438 the following: a) School A, г = .34; b) School B, г 
. The test w narized in Table V 2.38; c) School С, г=.34. In the absence of 


Pupi 
ils and the data is summ 
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TABLE У 


SUMMARY OF THE SCORES EARNED ON THE KWALWASSER TEST BY 438 
4TH AND 6TH GRADE PUPILS IN THREE SCHOOLS* 


4th Grade 
ڪڪ‎ Ц 


6th Grade 
Score 


а. 
Score 

School Number Range Mean Number Range Mean 

A 94 19-36 26. 88 108 13-37 28.73 

B 70 19-35 27.50 50 21-40 28.58 

С 54 20-36 28.37 62 17-37 29.66 


*The highest Possible score is 40. 


E OF PUPILS INITIALLY g 
THE TES М OF PHASE П 


Total Percent 4th Grade 
School Subjects 018 


10 21 15 

в 38 25 15 7 8 в 

А ч = н 10 13 10 
Total 151 100 49 27 


42 33 


————————— 4 
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other information, one may conclude either that 
teacher ratings provide а more valid way of deter- 
mining musical giftedness than do talent tests, ог 
that the reverse is a more appropriate conclusion. 
The writer decided to consider this aspect of the 
problem in more detail after the data had been col- 
lected for Phases I and П. Judgments regarding 
those pupils who were to be classified as musical- 
ly gifted were made on the basis of arbitrarily 
assigning equal weight to both the teacher ratings 
and the Kwalwasser test scores and then selecting 
Pupils who ranked highest on both measures. 

The sample for Phase II was selected on the 
basis of proportionate representation in terms of 
each school and each grade. The data showeda 
total of 475 fourth- and sixth-grade pupils in the 
three schools with School A accounting for 44 per- 
Cent of the total; School B for 25 perc ent of the 
total; and School C for 31 percent of the total. 
These proportions were maintained in the sample 
and, for each school, the proportionate number of 
Pupils enrolled in the fourth and sixth grade was 
also maintained. An arbitrary total of 151 sub- 
jects was established as the maximum number 
that could be tested within the time limits of the 
Study. The number of gifted pupils identified by 
fro two criterion measures was then subtracted 

rom the total number allocated to each grade in 
each school and the remainder constituted the 
number of average pupils. These were to be se- 


lected at random from the total distribution with- 
dard deviation on 


al subjects were 

te availability of al- 
rnates in the event a selected subject was un- 
le to participate in the testing program. Table 

ini Shows the number of average and gifted pupils 
itially selected for Phase П of the study. 


c : 
- Testing Procedures: Phase 1 


Several e ing sess 

i xploratory testing ” 

ns valuable information necessary for € 
Shing the testing procedures to be followed in 


dy. T i iscussed in the report 
hese will not be dis apap а 


ut аге m + а 
entioned so the rea А 
мема procedural decisions which were pue 
noe based upon actual test 8 an 
е arbitrary decisions of A ще 
е origi tained twen 
Vary; ginal test con 
arying length and complexity, selected gs 
in 55 04 the analysis of son als discusso 
tee ША. It was foun е trt 
of p 8t took too much time, апа: 
за test was reduced to three тА Е, 
item assumed that sever tri nificance 
than s Would provide data of greater 51811! 


ions had 


Шш i i different items- 
ould a single trial of many ams were 


Thr 

Са i ix-nO 

Used „2 four-, five-, and six ding scale 
i i ending 

and cnePresenting ascending and x ^ i rately 


Ord patterns, either PF es © 


or in combinations. The construction of these test 
items utilized some of the suggestions of Ortmann, 
who was interested in defining and measuring tonal 
memory in music (5, 6). The items were written 
as quarter notes with no measure bars or time 
signature indicated. All items were written in the 
same key to reduce the number of variables in the 
testing situation, with the key of Eþ major consid- 
ered to be most appropriate since it would keep 
the items within a reasonable singing range for the 
children. Three complete sets of 4 by 6 test cards 
were used with the notation on the cards duplicat- 
ing, as clos ely as possible, the printed notation 
in books in order to make for easier reading. The 
staff contained lines spaced 1/8 inch apart which 
were placed in the center of the card, the length of 
the staff being varied according to the number of 
notes in the test item. The note heads were oval 
shaped with a lengthwise diameter of 1/8 inch and 
were spaced 3/8 inch apart. The treble clef sign, 
flats, and quarter note stems were also of suit- 
able size. No subject reported difficulty in read- 
ing the notation, soitmay be assumed that the vis- 
ual stimuli were satisfactorily notated. One trial 
of this test is shown in Figure 1 at the top of the 


next page. 


А second problem was concerned with the man- 


ner in which the tonality and starting pitch was to 
be established for each configuration. Three 
methods were considered: a) to sound the pitch of 
the starting note of each configuration; b) to sound 
the tonic cho rd of E flat major, followed by the 
pitch of the starting tone; сн кес | дш 
and le 
key-tone E flat, or do, oy ee то 


starting tone of Тһе configura 
nA P shows that each of these methods 


in similar testing situations, with 

ma mid В ри considered to be the most dis- 
criminating in terms of reading ability. This calls 
for knowledge of not only basic music notation but 
requires the subject to establish the tonal relation- 
ships internally before si nging. The exploratory 
testing sessions established the first method as 
was concerned with the method 
heitems in the testing situation. 
ation of the exposure time, while not а 
саш point іп the 5 tudy, needed to be uniform 
to individual if the data were to be 

A memory drum with a 
was used initially but 
to be too brief for 


r preted. 
ure time 

sults showed the exposure 
the ге reading response. Therefore, a tape 
f the entire test was made, synchro- 


recording 0 ems which had been 


ith the individual test it 


өрөг: cards. The piano was used in prefer- 
є to some other medium because the children 
enc iliar with its sound. 


ing of the tape was accomplished by set- 


The tim ( 
Десігіс ше 


ting ап ere! 
second, with the flas 


4 JOURNAL OF EXPERIMENTAL EDUCATION 
28 


FIGURE 1 


THE 10-ITEM TEST FOR PHASE I 


than the clicks to keep the tape free of unneces- 
sary sounds. 
The examiner gave the verbal cue for h 
the stimulus card to the sub 
—” within a two-second interval. The 
Starting pitch was Sounded 
Sustained for two seconds. 


anding 


— 
pitch), ” the “ж Occurring each 5 есопа 
varying time intervals are Shown below: 


Length of Duration for 1st Duration of 2nd 
Item Interval of Time Interval 
Silence 
„А 
3 notes 8 зесопаз 5 seconds 
4 notes 9 seconds 6 seconds 
5 notes 10 seconds 7 seconds 
6 notes 11 seconds 8 seconds 


The subjects for Phase I of thestud 


У Were se- 
lected at random from grades four, fi 


utilized the regular test procedures and gave the 
Subject an opportunity to ask questions about апу_ 
thing that was not understood. Considerable em- 
phasis was given to the idea that hearing an 
item after the subject had tried to read it was де- 
signed to make it possible for the Subject to ге. 
call the sound of the item the next time it was 


а second tape rec order ata tape speed of ме 
inches per second, a procedure which insured ^ 
accuracy of processing and scoring the responses. 


The testing procedu res for Phase I may be 
Summarized as follows: 


l. The subject was given a visual aai 
of the first item, accompanied by the: star 
ing pitch of the group. se 

2. The subject then Sang an immediate respon 5 

to this visual presentation, reading the note 

of the Configuration. d 

This reading trial was followed by a SECO 

Visual presentation of the item in com ni 

tion with an aural pre sentation of the ваше 

item. This permitted the subject to associ 

ate the sound of the item with its visual а) 

resentation as well as to provide a mode 

against which he could compare his origin 
attempt. : 

4. Immediately after this aural presentation, 
the subject sang what was heard, continuing 
to look at the notation of the item. : 

5. The subject then proceeded to the next uere 
following Steps 1, 2, 3, and 4. The testi 8 
continued in this manner until all the item? 
had been presented, с- 

8. The total list of items was presented a хи 
ond and third time, with the items жамалы, 
ized for each Successive presentation 


eliminate the ров sibility of rote or seri 
learning, 


PE t 
Processing the data presented no significan 


es 
Problems, The tapes contained all the respons я 
of the Subjects 
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ing system used was the number of items for which 
correct responses were made in two of the three 


trials. 


D. Testing Procedures: Phase II 


Twokinds of learning situations Were COn- 
structed which differed insequence of tasks rath- 
er than in content. These will be designated as 
Sequence A and Sequence B. Each sequence con- 
tained two tasks and the subject was given арргох- 
imately 15 minutes to complete eachtask. In both 
sequences the two tasks were separated by a 48- 
hour time interval. This time interval between 
tasks was used for tworeasons: a) the results of 
the testing in Phase I indicated that children of 
this age group were not able to give concentrated 
attention to tasks of this nature for sessions last- 
ing much longer than 15 minutes; and b) the test- 
ing was to be done during the time the child met 
with the regular musicclass, these classes being 
scheduled on alternate days rather than daily. 

These two sequences, together with their tasks 


may be diagrammed as follows: 


Task I 


5 to learn five 
n eight trials 
dures estab- 


The subject attempt: 
tonal configurations i 
following the proce 
lished in Phase I. 


Sequence A 


t attempts to learn the 


The sub ёс 
ted from the 


short song construc 
five test items. 


Sequence B 


The material used inboth sequences was the same 


and the testing procedures followed in each task 
Were similar for both sequences. The major dif- 
erence between the sequences Was the order in 
Which the tasks were presented to the subject. 


Persons familiar with the procedur es recom- 
ic reading skill will 


Mended i ; 
n developing music ч 
j COgnize that Sequence Ais pased on the assump 
хоп that preliminary learning of certain tonal con- 
earning of a 


figurati id 1 

ions will ult in more гар! 

га ations. Sequence B 
eading 


Е that music г E 
Skill should begin with the song and then proce? 


to the dri 
4 drill work on the mu с 
std in the song. The original design of the 
y ~ | included а s ec ond short song р 
: Subjects in both sequences w! іс д 
dy adea much valuable data regarding the rela 
the, Merits of each sequence. 
tecre Was not sufficient time to 
esting, 
а pjects 
E S hoped that most 54016 
г learn the test material in 2 single tes 
Оп. Since it could be assumed that mo 


would be able 
ting 5257 
ге trials 


on fewer items would result in a higher percent- 
age of success than fewer trials on more items, 
it was decided to use a test of five tonal configura- 
tions. Item analysis of the data on the visual re- 
sponses to the items in Phase I resulted in the se- 
lection of the five items shown in Figure 2 (top of 
next page). The data for this item analysis will 
not be presented in the report since it is not con- 
sidered pertinent. The techniques employed in 
making the item analysis were suggested by Flan- 
agan (3). 

Since the tonal configurations were again to be 
presented as quarter notes without time signatures 
or measure bars, the song to ре used should not 
contain significantly different notations. It was 
decided to write a song rather than attempt to find 
-texts which c ould serve the purpose. 


one in song 
d in constructing the song: 


These guides were use 


1. It should be reasonably musical without du- 
plicating any familiar songs. 

2. Two phrases, each containing four measures, 
was considered an ideal length. 

3. No material other than the five tonal config- 


Task II 


The subject attempts to learnashort 
song constructed from the five sepa- 
rate items presented in Task I. 


The subject attempts to learn the five 
tonal configurations which had been 
presented in the song. 


eady selected was to be includ- 
ever, permit the rep- 
tonal configuration or 


urations alr 
ed. This would, how 
etition of at least one 


of single tones. 
4. Notation Was to use only quarter and half 
notes, with each tonal configuration appear- 


ing as a single measure whenever rhythm- 
ically possible. 
5. The song would be inthe same key as the 


tonal configurations and would encompass а 


reasonable singing range. 


were constructed and sub- 
ition students, 


theory teachers and public school music teachers 
for reactions and comments. There was consid- 
erable agreement that two of the ten songs were 
musically more interesting than the others and the 

lected by tossing а coin. This 


test song was thense : 
song, with the tonal configurations identified, is 


resented in Figure 3. 
The subjects for Phase П of the study were se- 


lected as described earlier in the report. Each 
of the three schools had four groups of subjects: 
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FIGURE 2 


THE 5-ITEM TEST FOR PHASE П 


Шен нын 


FIGURE 3 


THE SONG FOR PHASE II 


a) fourth grade, average; b) fourth grade, gifted; 
» average; and d) sixth grade, gift- 


quences gave eight 
wn in Table VII, 


ndividually, as in 
Phase I, and the testing consisted of two 15-minute 


sessions separated by a 48-hour time interval, 
Before the testing Started, each Subject was given 
а brief orientation comparable to the orientation 
used inPhasel. Subjects in Sequence A were told 
that during the first session they would try to 
learn to read five tone groups, Or “musical 
words, " and that in two days they would try to 
learn to read a song which used the same tone 
groups. Subjects in Sequence B were 1014 that 
during the first session they would try to learn to 
read a song and that two days later they were to 
learn to read five tone groups, Or ‘musical 
words'' which had been used inthe Song. Theori- 
entation also included a short practice 5 
reading either four tonal configurations or two 
measures of a song, depending upon the sequence 
the subject was following. 

The procedures for the two tasks were the 
same regardless of the Sequence being followed, 
The process forlearning thefive items was ident- 
ical with the process followed in Phase I and de- 
scribed in detail earlier. There were eight trials 
of the five items, with the first trial considered 
as a pre-test, and the items randomized for each 
successive presentation. Learning the song fol- 
lowed the procedures outlined below: 


ession in 


1. The subject, with the notation of the Song 


before him is given the pitch of the өг 

note of the song and asked to sing as pei 

of it as possible. This Served as a pre- the 

and was used as a basis for determining ing 
amount of learning which took place duri 

4 the testing. hav- 

2. This trial performance was followed by He 

ing the subject listen to a performance of „= 

+ entire song, played at a гаје of two апага. 

notes рег second. During the listening íi 

Subject was encouraged to follow the natane 

and, if desired, to sing along with the алы 

3. Following this 115 tening, the subject w^. 

asked to sing through the song a second ben 

4. Steps 1 and 2 were repeated until the n 

had had an oppo rtunity to complete at le e 

eight trials of the song, a trial being defin 


to include both the Singing and listening 257 
pects. 


The reader 
that a process 
place rather thana 
notation of th 
actually осе 5 
fourth Subjec 3 


os 
to sing farther than that point; and ши 


5 ther 
Jects were, OWever, ableto proceed much ate 
When using the notation and so it appears г 

able to ass 


4s resulting from the testing. 


Н n the 
ne Significant difference existed betwee 


A 


School A 
School B 


School C 


Totals 


Average Gifted Average Gifted ‘Average Gifted Average Gifted 
10 5 10 5 11 8 10 # 1 66 
T 4 8 3 4 4 4 4 
7 5 1 5 T 6 5 5 
24 14 25 13 22 18 19 16 
“ 
т LJ 
TABLE VIII 


4th Grade 5th Grade 6th Grade 
N Mean N Mean N Mean 
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TABLE VII 


THE INITIAL ASSIGNMENT OF THE 151 SUBJEC TS PARTICIPATING IN PHASE П 


6th Grade 


4th Grade 
Sequence “А” Sequence “В” 


Sequence “А” Sequence “нң” 


ТНЕ МЕАМ SCORES FOR THE VISUAL PART OF PHASE I FOR EACH GRADE 
IN EACH SCHOOL 


^ Combined 
Grades 


School N Mean 
11 4.36 31 4.5 


10 4.9 


4.2 
4.1 28 3.1 


287 


Totals 


38 
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procedures followed in learning the patterns ont 
in learning the song; each pattern was presen ү 
individually for both the reading апа ы. at- 
tem=sts while the song was always presented asa 
com plete entity. In order to minimize this differ- 
ence, the writer decided that following each read- 
ing trial of th. song, and prior to the Subsequent 
listening, the reading errors made by the subject 
would be pointed out and verbally described by the 
examiner. This decision was based on the analy- 
sis of the data from Phase I which showed that 
many subjects persisted in making the same er- 
rors because of an inabilit: 
their reading attempt as a 
with the item when it was р 
held true for items treat. 
be more true when the su 


recorders; b) one set of 
items, there being suffici i 

them for the next trial; and c)one5 x 8 index card 
containing the song, with the notati 


size as used in Phase I, The mast 


в, although eight над been 
considered sufficient, 

Processing the data was carried forward in the 
same way аз for Phase |, The data for both the 
patterns and the Song were transferred to the in- 
dividual data sheets and scores, in terms of cor- 
rect responses, were calculated for еасћ subject. 
Three kinds of scores were used: 

1. А “регсепі correct” 

for each item in the 

each item in the son 

fined as the tonal co 
‘Percent correct” = 

Number of trials 


item correct 
Total number 


of trials 


score was calculated 
five-item test and for 
в, the item being de- 
nfiguration: 


2. A “теап percent correct» 
culated for each task. Thi 
mean score: 

**Mean percent correct’’= 

Sum of Ше “% correct scores” 


= Оп each item” 
Number of items in task 


Score was cal. 
S was a Simple 


For both of these scores the unit ии 
served as Ше basis of the score was the гей 
dividual item. The first trial was not i 


cluded in the total number. of trials, аз ех- 
plained later. 


З. A “rate of learning" score was ашшы 
for each task and represented ап paces 
locate a single score which could ӨӨЛД” 
the true net gains made on the task € 
thesame time, take into consideration 5 
number of trials needed to make those gains. 
Theformula used for this “RL score" was: 

RL - true net gain x .100 


Total N of trials-1 


" -р 
The true net gain can ђе expressed as 27 
in which: 
sa) ident- 
а = Score on final trial, the final trial oni 
ified as the first of the two consecu 
ly perfect trials. 
b = Score on first trial. n 
€ - Perfect score, thescore assigned P any 
trial in which all items are correct. 


d 

The actual gain made by a subject is epre ia 
in the numerator, «53-5: the total gain pes Б 
for the subject is expressed in the denomina is 
““c-b.”” The result, which can never exceed 1, 
the true net gain, t the 

Since this true net gain does not reflec the 
number of learning trials taken by the subject, er- 
Second part of the formula was used, The ipee 
ator, 100, is used so the score may be Luo ages 
in larger units, The denominator indicates po 
tual number of trials during which learning С on- 
take place. The first trial for both tasks was са і 
Sidered a pre-test of previous knowledge, В task 
was assumed that learning the content of the 


ї m 
did not begin until the subject had heard exi after 
tent. In the case of the song, this occurred 22 


М an 
3 in the case of the tonal config 
urred during the first complete" 
ег the individual item had pee 
refore, this first trial was subtr? 


Subject had. done 00 
In I the scoring of the items was ided, і? 
a “right” or “wrong” basis. It was Чери Ён 
: 1 ture of the tasks, to use а X Thi$ 
ternative for Scoring the items in Phase П. xe. 


rong 


, 
ат! , 
the original item. For ех г 


5 
J locate the starting pitch апа for 
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one-half a correct response. This scoring pro- 
cedure gave data which more accurately described 
the music reading abilities of the subject since he 
was not penal ized for his inability to sing inde- 
pendently. There are а great many pupils in the 
elementary grades who can read and interpret 
music symbols witha high degree of accuracy but 
who need the support of an accompanying instru- 
ment to maintain perfect tonality and intonation. 
The writer did not, in this explor atory study, 
wish to disc riminate against the child for whom 
the task was already very difficult. 

In order to illustrate the entire scoring pro- 
cedure, a sample data sheet used for scoring the 
song is given in Figure 4, and the three scores 
are indicated as well as the process of deriving 
the scores. The tonal configurations appearing 
in the song are indicated with prackets, and the 
individual notes outside the bracket in measures 
4 and 8 are not included in the scoring. A value 
of eight points is given for each correct tonal con- 
figuration yielding a total perfect score of 48 for 
the six items. 


ту. Results of the Study 


А. Phase I: Visual 


ptained for 89 of the more 
than 100 pupils tested in Phase I. Since the sub- 
jects had been selected at random from the total 
fourth-, fifth-, and sixth-grade population ineach 
School, it was highly possible that some subjects 
would be tested whose data would not be usable. 
Such subjects would include the following: 


Usable data was 0 


1. Pupils who verbally expressed no desire to 
be tested; | 

2. Pupils who were unable to match aem 
and consequently would not be able to = s 
any singing response to either the visu 
aural stimulus; 

3. Pupils whose ability t 
heard was so poor that they У 
Sing morethan two correct response 
the ten items; 

4. Pupils who became 5 
testing that they were un 


o repeat what they 
ere unable to 
s out of 


o nervous during the 
able to continue. 


igual’? will 

the term «yijsual'" Wi 
a s made 10 the vis- 
the reading aspect 


b In discussing the dat 
us usedto referto the response 
al presentation of the items 


*Calculation of “t used the following formula: 


eph 


of the test. The term “авга!” will be used to re- 
fer to the responses made to the aural presenta- 
tion of the items—the listening aspect of the test. 

The data for pupils in each grade in each school 
were first tabulated to ascertain the kinds of 
scores whichhad been made. The basis of scor- 
ing was to count an item correct if a correct ге- 
sponse had been made for any two of the three 
trials. An item was scored wrong if it was in- 
correctly sung or if no trial or only a partial trial 
was made of the item. The score was the sum of 
the individual items correct, with a maximum 
score of 10. Table VIII s ummarizes the mean 
scores for each grade in each school. 

The data shows the mean score for the fourth 
grade to belower in each school than the mean 
scores for the two upper grades. The sam e dif- 
ference appears when the three schools are com- 
bined by grade level. The data also shows the 
mean score for the sixth grade to be higher than 
the mean score for the fifth grade in Schools B 
and C, but not in School А. When the three schools 
are combined, however, thereisa steady increase 
in the size of the mean from fourth grade through 
sixth grade. 

A testfor the significance of the difference be- 
tween the mean scores of each grade, combining 
the threeschools, gave F - 1.5068, and it may be 
concluded that there is no significant difference be- 
tween the three 8 rades in terms of the data of 
Phase 1 (Table IX). The five percent level of sig- 
nificance is used throughout th e study. This, of 
course, has significant implications if intensive 
testing at these levels continued to support these 
findings. A test for the significance of the differ- 
ences between the scores for the fourth and sixth 
grades, the grades to be studied in Phase П, gave 
a ж of 1. 8397 and the differences were consid- 
ered as not significant. 

Table УШ also shows the mean scores for each 
grade to vary from school to school. For the 
fourth grade, School A had the highest mean, with 
d Bin descendingorder. For the fifth 
15 A and С had identical means, and 
for the sixth grade School C had the higher mean, 
with schools A and C in descending order. This 
would appear to indicate that subjects in each 

sistently performed ata low- 


grade in School B con [ 
er level of achievement than subjects from the 
other two schools. The mean scores for the com- 


bined gr ades in each school shows Schools A and 
С to have overall means which are very similar 


1 
in which Dx? = үү NZ? - (22) ] 
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TABLE ІХ 


ANALYSIS OF VARIANCE BETWEEN GRADES FOR THE VISUAL 
PART OF PHASE I 


Degrees of Mean 


Sum of 


Source of 

Variation Squares Freedom Square 

Total 435. 8202 88 

Grades 14. 7545 2 7. 3113 

Pupils in Grades 421.0657 86 4.8961 
(Е=1. 5068) 


F” in Palmer О. Johnson’s Statistical Methods in 


See «Distribution of 
І, 1949), рр. 362-65. 


Research (New York: prentice-Hal 


TABLE X 


ANALYSIS OF VARIANCE BETWEEN SCHOOLS FOR THE 
VISUAL PART OF PHASE I 


Source of Sum of Degrees of Mean 

Variation Squares Freedom Square 

Total 435. 8202 88 

Schools 35. 2830 2 16. 6415 
402. 5372 86 4. 6807 


upils in Schools 
(F= 3. 5553) 


au „ы 
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j oint high- 
d, at the same time, more than one p | 
те: than the mean for School B. Analysis of уагі- 
ance yielded ап Е = 3.5553 whichis significant at 
the five percent level and one may say that a Sig- 
nificant difference exists between the three 
ls. 

м analysis of the data showed the follow- 
ing: 


1. There is no significant difference between 
the mean scores for the combined grades of 
School A and School С; “Û? = .0907. 

2. There is a Significant difference between 
the mean scores of School A and School B; 
wt (20004. 

3. There is a significant difference between 


the mean scores of School C and School B; 
“че 2.2946 


c) the music teacher; and d) 


Subjects in other Schools, 
Examination of the rating shee 
the music teacher Showed that 
placed in the lowest group in te 
ability were always the boys. 
received so low a rating. Thep 


ts completed by 
Pupils who were 


‘роу problem’? in music activi 
grade levels. There is however 


sequently for higher teacher ratings. 
The subjects for Phase I had been s 


and girls at each grade in each School. A sum. 
mary of the results on the visua] part of the test 
is given in Table XI. This shows the following: 


1. For School A, the boys earned higher scores 
than the girls in both the fourth and Sixth 
grade and these scores were very close to- 
gether in fifth grade. When the Scores for 
three grades were combined, the boys had 
a higher mean than did the girls, 

2. For School B, the girls earned higher 
scores than the boys in both the fourth and 
sixth grades. Whenthe scores Гог the three 


grades were combined the girls had a high- 
€r mean than did the boys. 

3. For School С, the girls consistently did bet- 
ter than the boys. 

4. For the three schools combined, only the 
fourth grade showed little difference between 
the boys and the girls. | 

9. For the combined grades in the combined 
schools, the boys had a mean of 3. 67 and 
the girls had a mean of 4.39. 


The <t’? test of Significance gavea "EC of 
1.5457 and it may be concluded that there is T 
significant difference between the mean scores 0 


the boys and girls when all grades and all schools 
are combined. 


1. Analysis of Reading Еггог5— A considerable 
amount of time was spent — 


as Spent in making a detailed 
analysis of the reading errors made on each of the 
ten items of the test. The purposes of the analy- 
SiS were: a) to identify any Common errors which 
Occurred; b) to attempt to identify the progress 
Shown by the Subject for the three trials; and с) 
to provide nec essary data for the Пеш-апа!у515 : 
The data for the item analysis has been p 
regarding the progresso 
t considered pertinent to the 


he analysis of reading errors 
will be discussed at this time. 


no trial" response 
по attempt to sing a response 
mulus or failure to Fonros i 
WO notes of the item. Table 25 
65ропвев made by the 89 „У 
ет оп the test. The reader i 


table Shows the f 


l; Fourth-grade pupils had twice as many Жо 
trial" responses аз either of the other ЕУ 

grades. Fifth and Sixth grade “по tri 

responses are very similar. es 
е mean number of ““wrong’’ respons 


f 
for sixth grade 15 slightly lower than that 0 
the other two рг 


grades, — 
3. The mean “Tight” responses shows im 
Provement Т 


һе 
' rom grade to grade although = 
differences are not Statistically signific 


mean *'' right? responses. 16 
› 3, 4, Тапа 10 had far fewer = 
te Sponges than the other five items 0 
est. 


97 


ht 
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The scope of this report is such that a de- 
tailed discussion of all reading errors for each 
item would prove impractical. The writer pro- 
poses, therefore, to present а brief summary of 
the errors made most frequently by the 89 sub- 
Jects, using tonic sol-fa syllables for identifica- 
tion. A dash below the syllable indicates that the 
Pitch is lower than the preceeding tone; a dash 
above indicates that the pitch is higher. 


Item 1: (FM RD) 
Within the 98 errors made by th 
grades, the following appeared most frequently: 


e combined 


1 FRTS 10 4) FSLF 5 

2 ЕВТЕ 5 5 FSLT 5 

3) FRMD 5 6) FMeRD 4 
7) FRDTe 4 


Total 49 


The remaining 49 errors had frequency counts of 
: То 3, 25 errors having a frequenay count of only 1. 
It is inte resting to note that Error 5 is an exact 
inversion of the pattern and Error 4 а partial in- 
Version; that errors 1, 2, and 8 are descending 
Chord figures; and that Errors 8 and 9 began on 


the wro i i that the actual 
n te the fact tha 
we pen seconds. In 


starting pi two 

g pitch was sounded for ћ 1 
fact, 27 of the 98 errors included failure to begin 
On the correct pitch. The scores for the E 
Srades are very similar although the fourth grade 


ад more “по ла? responses. 


е combined 


erate ih De DL E ant frequent 
1) рт, ТІ 4) DSL 8 
2) рам 44 s DSF 9 
3) DLF 26 в psp 8 
7) DIS 6 
в) DTeL 4 
Total 177 


ercent 
Thi eight errors account for 177, ог 84 P 


of thetotal wrong responses. Of remaining 34 er- 
rors, 17 had afrequency of 1, four had a frequency 
of 2, and three had a frequency of 3. Itis inter- 
esting to note that Errors 1, 2, and 3 were the 
most common. Children do not appear to notice 
the interval of the minor third followed by the ma- 
jor second since they sing the pattern either as a 
partial scaleorasa complete chord figure. There 
were only 37 correct responses out of a possible 
total of 267. Itis of interest to note that the sixth 
grade had more **wrong" responses and fewer 
“right” responses on this item than either of the 
other two grades. Only17 errors were caused by 
failure to start on the correct pitch. 


Item 3: (SMDR TD) 
The errors appearing most frequently in the 
total of 218 were: 


1) SMDMDM 14 4) SFMFRM 10 
2) SFMFMR 13 9 SFMRDT 7 
3) SFMSFM 13 6) SFMFSF 6 


7) SMDRSD 6 


8) SMDMRD 5 


Total 19 


unt for only 79, or 3.6 
most of the remaining 


ing frequency coun or 2 
перу | ihn most complex because it included 
i sandwas very disjunc 
Ё m сул that it had the fewest number of cor- 
5 The same held true 


responses of any item. п 1 с 
a the ісі responses to the item; it again had 


er of correct responses. 

фео ашт 8 апа 9 show а partially correct 
response being made, pres um ably because the 
first part of the pattern was familiar and could be 
d better than the last three notes. Er- 
plification of the pattern so it be- 
while the other errors enm 5 m 
а 3-tone pattern. Failure to be- 

“гоп Hexe fe pitch accounts for only 32 of 


the errors. 


Item 4: (DSL TD) 
ding interval of 

is item be! an with a descendi 
This ite timed with an ascending scale pat- 
d to the starting tone. The in- 
i t one to Sing and this is evident 
га, The 9 errors шові іге- 


ted for 100, or 57 percent of 
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the total ‘‘wrong’’ responses. 


1) DTLSF 32 4) DDRMF 13 
2) DTLTD 19 5) DMFSL 8 
3 DLTDR 16 6) DLSFM 7 


7) DFSLT 5 


Total 100 


istic was also true fi 
errors which had fr 
Sixth grade had cons 


or most of the remaining 76 


20r3. The 
right" re. 


Item 5: (DMS) 


This is one of the more famili 
the test, a fact reflected in the 1 
‘‘right’’ responses by all grades, 
in which the pupils in the fourth and 5; 
duplicate each others performance, 
had only 85 *wrong" responses, arec 
ed only by item 9, and one error, *DRM» 
a frequency of 53, or 62 percent о 


ar items on 


had 
f the total, In 


really read the 
* A total of 13 


€ to begin on the cor- 
rect pitch. 


Item 6: (SLSF M) 


This combination of neighboring tones and 
a partial descending scale Was easier for the Sixth 
grade than for fourth and fifth, Whose scores are 
similar. A total of 62 different errors were 
counted and the 7 mostfrequently made accounted 
for 15, or 49 percent of the total number of 
**wrong'' responses. These are: 


1) SLSMD 20 4) SMFRD 9 
2) SFMRD 15 5) SLFMR 8 


3) SDSMD 12 6 SLSFR 6 


7) SLSMR 5 


Errors 1, 6 and 7 showa partial success in read- 
ing the item, while Error 2 results again from 
attempting to use a familiar response to ШИШЕ, 
which is not Carefully observed. Errors 3, 4 an 
5 represent inaccurate responses whieh арт 
Strongly influenced by the aural present ation 0 
the item in the trial which preceded this d 
lar item. Only 12 errors were identified as fai 
ure to begin on the Correct note. 


Item 7: (SD M) 


This item, an inversion of the more fam 
lar item 5, contains within its first two notes ga 
of the most common tone patterns identified EE 
Songs. It ranked first in the analysis made by 5 

i Smiley’s analysis, and is in variab F 
found in songs which have anacrusis, or “pick 
шр поје." Despite this, the faci that it pr 
with anascending fourth presents a definite геа! 


1) STR 49 5) sTD 8 

2) sLT 36 6) SRF 5 

3) DRM 10 7) SES 5 

4) SLD 9 8) TRF с 
9 том 4 


NS — 


ш. 
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for 46 percent of this same total. The tendency 
to reduce the interval of a fourth to an interval of 
a third and proceed to sing а tonic chord is evi- 
dent in Error 1; the tendency to revert to a scale 
pattern whenever the reader is in doubt is shown 


in Error 2. 


Item 8: (MRD) 


This is one offour items for which the num- 
ber of “де? responses exceeds the number of 
‘wrong’? responses. The fourth- and fifth-grade 
pupils perform almost the same way, with the 
sixth grade showing better res ults. Only 17 of 
the 88 errors resulted from beginning on the wrong 
pitch, The familiarity of this pattern is estab- 
lished by the scores. The common errors are: 


1) MDS 24 4 MDL 5 
2) MFS 10 5) MFR 5 
3) sMD 6 6) SFM 4 

Total 54 


These errors account for 61 percent of the to- 
tal and it is interesting to note the frequency with 
which this simple scale pattern becomes a de- 
Scending tonic chord, Error 1 has a frequency of 
24, or 27 percent of the total number of errors. 
The reader is referred to the analysis of Item 5 


which shows parallel results. 


Item 9: (LS FMRD) 


This six-note descending scale proved to be the 
lgra des appeared 


easiest item of the ten, and 

to perform with equal ability. There were no er- 
rors having afrequency greater than five although 
a total of 37 different errors were counted. Fail- 
ure to begin on the starting tone did account for 
20, or 35 percent of the total errors made. 


Item 10: (SMFR) 

The disjunct charac t m шабе 1. 
m difficult than one WO, gu 

ence of descending thirds occur Е Ё 
Songs. Again, very little diffe rence exists be 
tween the fourth and fifth grades тИ total of 
grade achieves slightly better scores. 
39 different errors were tabulated am of six OF 
Our given below had freque ncy counts O 
ess; 


1) SFMR 27 


2) sMSM 14 


Error 1араїп represents this phenomenon of sub- 
stituting a familiar tonal configuration for an un- 
familiar one, possibly the result of carelessness 
but more often resultingfrom the subject’s inabil- 
ity to ‘“‘hear’’ the pattern pefore itis sung. Only 
nine errors of the 182 were caused by an incor- 


rect starting pitch. 


2. Summary of the Findings—This detailed an- 
alysis of reading errors emphasizes several im- 
portant points. The total of 1494 “wrong” and 
144 “по trial" responses as compared with 1072 
“right?” responses shows that children in the ele- 
mentary grades experience considerable difficulty 
in reading tonal configurations which are com- 
monly found in the songs children sing. 


1. During the testing period, the writer found 
that most children read the item very slow- 
ly, а single note at a time. This is not ap- 
parent in the data because there is no way of 
indicating this kind of performance other 
than to listen to the tape-recorded responses. 

2. Although children expe rienced little diffi- 
culty in makinga response to the aural stim- 
ulus they were seemingly unable to “таке 
the voice go the way the notes do,’’ as 5еу- 
eral subjects pointed out. This implies that 
skill in reading music can be developed only 
if the child is able to “hear” the item si- 
lently before he sings it. Developing this 
internal “hearing” presents many problems 
including an adequate identification of effec- 


tive teaching pro 


3. Children are usu 
shape or direction of the tonal configura- 
tions but fail to acquire the ability to accu- 
nal changes which 


rately identify the inter 
make one pattern different from another 


even though the two patterns appear quite 


similar. 


4. When children are asked to read tonal con- 


figurations which are not immediately rec- 
ognized as something they know, one of two 
types of responses generally result: 1) chang- 
ing the pattern 50 that it becomes a famili- 

chis very similar to the stim- 
essing that the response 
thing like the stimulus 
sound basis upon which to 
f their response. The 
ber of diffe rent errors which the 
children made on the 10 items in evidence of 
the second kind of response. Excluding the 
по trial’? responses, 489 different errors 
were made in accumulating the 1454 ‘бугопе” 
responses, giving an average frequency of 
2.97 per error. Evidence of the first type 
of response is indicated by the fact that 61, 


ог 12.47 percent of the 489 different errors 
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had a total frequency count of 787, or 54 
percent of the 1454 ‘‘wrong’’ responses, 

5. The data from Phase I shows comparatively 
little difference between grades in terms of 
general ability to perform the task. Part of 
this might be due to the fact that three trials 
of the 10 items did not give Ше Subjects suf- 
ficient opportunity to learn the items. The 
question arises as to whether more trials 
would have shown significant differences be- 

tween the three grades. Since most music 
reading activities begin at the third-grade 
level, this means that sixth-grade children 
have two additional years of experienc: 


as to why this is true, 


B. Phase I: Aural 
——- Aural 


РагаПе пе the 
an analysis of th 


ble to work with famil? 
iar concepts and materials, 


The data on the singing responses to 
presentation of the item was processed in 
Way as the data for the vis ual part of 


the aural 
the same 


any two of the three trials. An item 
aS wrong if it was incorrectly sung 
ог only a partial trial had been made on the item, 
The score was the sum of the individual items 
correct, with a maximum of 10, Table sum- 
marizes the mean scores for each grade in each 
school. 

The data shows that fourth- and fifth-grade 
subjects, in each school and for the Combined 
Schools, had lower m ean scores than the sixth- 
grade subjects. Only in School B was there a 
marked difference between the means of fourth- 
and fith-grade subjects. These results are sim- 
ilar to the results on the visual part of the test in 
terms of the relative achievement of each grade, 
The reader may wish to compare these means 
with the means on the visual part presented in 
Table УШ. Such a comparison Shows that Pupils 
in each grade earned a score on the aural Part of 
the test that was approximately four points higher 

than the Scores for the visual part. In other 
words, they were able to double their achieve- 


Was counted 
Or if no trial 


ment when an imitative, rather than a reading re- 
Sponse was required. 

д A test for the Significance of the differences 
between the mean scores of each grade, combin- 
ing the three schools, gave F = 1. 6297 and it may 
be concluded that there is no significant difference 
between the three grades interms of this data (Ta- 
ble XIV). 

This may well indicate that the ability to re- 
spond to an aural stimulus is already developed у 
the fourth-grade level and does not change signif- 
icantly after that time. h 

Table ХШ shows the mean scores for с 
grade in School A and С to be very similar Б 
that subjects in School В consistently pe rforme' 
at a lower level of achievement, especially when 
the fifth grades are compared. However, UA 
Sis of variance between the schools gaveF - 2.731 
and it may be concluded that no significant differ- 
ence exists between the three schools when sub- 
jects are asked to respond to an aural stimulus 
(Table xv). | 

Тһеге 15 по геазоп to suspect that boys differ 
Significantly from girls in ability to respond to an 
aural stimulus except the difference which may ГА 
due to the general inclination of the boy to avoi 
using his singing voice. The combined mean score 


for the 43 boys tested was 8.023 and for the girls 
it was 8.587. 


ср? 


, These data would tend to support the point-of- 
view that audit 


t a 
огу perception, as measured by 
Singing response to an au 


developed by the time a с 


nging Errors—A detailed ап 

analysis of the singing are made on each of the 
ten items was carried 
Cedures изеа 
rors. Ther 
Subjects had i 


in whether Subjects with high scores on the aur 
Part of the test 


the visual Part. Th 


were computed ang resulted in the following: 


1. Sixth grade, r=.64 
2. Fifth grade r= 10 
3. Fourth grade r=.58 
4. Combined grades, r= .64 


7 : ine 
The writer 15 unable, at this time, to determi 
Whether these Could be considered high, mediu 


от low correlations in terms of the two skills Р” 
ing Considered, тұ is of interest to note that fi 


д 
— 
—— —Ó 
— шшш 

А: 
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TABLE ХШ 


THE ME 
AN SCORES FOR THE AURAL PART OF PHASE I FOR EACH GRADE IN EACH SCHOOL 


4th 5th 
6th с і 
School N Mean N Mean N Mean N шиг” 
ean 
10 8.5 10 8.5 11 8.8 31 8.6 
и : 
9 7.8 9 7.0 10 8.4 28 7.8 
11 8.3 9 8.4 10 8.9 30 8.5 
Total 30 8.2 28 8.0 31 8.7 89 8.3 
TABLE XIV 


ANALYSIS OF VARIANCE BETWEEN GRADES FOR THE AURAL 
PART OF PHASE I 


Degrees of Mean 


Source of Sum of 

Variation Squares Freedom Square 

Total 219. 1910 88 

Grades 8.0039 _2 4.0020 
211.1871 86 2.4557 


Pupils in Grades 
(Е-1. 6297) 


КЕБЕ eee 


TABLE XV 


LYSIS OF VARIANCE BETWEEN SCHOOLS FOR 
чє PART OF PHASE 1 


THE AURAL 


Degrees of Mean 
Square 


Source of Sum of 
variation Squares Freedom 
88 


Total 219. 1910 
6. 5598 
Schools 115-1195 ES T 
gs 206: 0715 86 | 
(е = 2.1376) 


Pupils in Scho 
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grade children are appar ently more consistent 
than children in the other two grades, with the 
fourth grade not maintaining as high a str aight- 
line ‘‘relationship.’’ This is a problem which 
needs further investigation before any Sound соп- 
clusions can be drawn. | 
The data presented in Table XVI Summarizes 
the responses of the 89 subjects, these responses 
again being considered s eparately for each trial 
so that а perfect scoreonall ten items is 30. The 
table shows the following: 


1. There were very few “по гізі? responses, 
only 16 for the three grades, with the Sixth 


spectively. 

3. The mean “тірі” 
Sixth grade to b 
other two grades. 


4, The mean “right? responses for al] grades 
15 almost five times greater than the Mean 
“wrong” responses. 

5. Only item 3 shows the “wrong” 

i 5 responses 
ке the “right” г €Sponses, а result 
attributed to the length and complex; 
the item. enar 


responses show the 
е Slightly better than the 


This clearly points out the comparative 
difficulty of the two aspects of the test. 


The writer does not 


reading errors for this test gave 61 


and accounted for 54 percent of the 
of **wrong" responses. Forthe aur 
only 22 such common errors were noted and these 


‘‘wrong’’ responses, Опе reason for the lack of 
common errors lies in the differen 
two tasks. For the visual part of the Study, with 
the subject singing the starting Pitch immediate] 
after it had been given, failure 
correct pitch accounted for 1,4 Percent of the to- 
tal errors; for the aural part of the Study it ac- 
counts for 39.3 percent of the total errors, This 
inability to locate the correct starting pitch for 
items presented aurally is probably due to the 
fact that the subject had to retain this pitch during 
the entire aural presentation, so m ething which 
many of the pupils were unable to do. 


2. Summary of Findings— The analysis of SIBE 
ing errors which has been briefly presented shows 
that aural imitation of an item is considerably 
easier for children than making responses to visu- 
al stimuli. While 04.5 percent of the responses эч 
the visual presentation of the items were **wrong 
responses, only 17.5 percent of the responses to 
aural presentations were "wrong". This again 
emphasizes the difference in the difficulty of the 
two kinds of responses. The ‘““wrong’’ responses 
to the aural presentation of the items showed few 
common errors and this suggests that guessing 15 
taking place. It appears unlikely that music read- 
ing skill can be developed by continuing to empha- 
Size rote-learning and imitative singing in the up- 
Per grades. Thedata show that little difference 
exists between grades in terms of this auditory 
Perception. The major problem centers around 
the methods used to help the Child relate this aud- 


itory Perception to the visual Stimuli so the latter 
are meaningful, 


Originally 151 subjects had been chosen for this 
aspect of the Study but several of these were not 
available for testing and So data was secured for 
138 fourth- and Sixth-grade Pupils. Table хуш 
Shows the distribution of these Subjects according 
to school, grade, and the Sequence to which they 


Strument Played and 
Studied, ы и: 


Table XIX summarizes the information regar d- 


ing age and I. Q. , of the А . the figures 
Siven being the m Subjects, with 


This Shows relatj 
Sach group in the 
ег difference be 
Sroups intermg of «оз " group 0 
each grade hasa h 10. The axe 


erage” &ГОйр in th 
It is of interest tg 


jects in Schoo] C ig i Елит of the 
ther 
Other two Schools higher than in ei 


: 3 wer 
th School A slightly lowe” 
than School p, у.“ os 
. » . 5 рееп р н 
Sible, within the ав Шао, it has not 


t 
either the 5; 51111 cance of the differences OF e 
possible effect I.Q., might have upon the subject 
performance in the tests or Phase II. game 
on the topic tends to Show that a high level of ы 
tive musical capac ity is not necessarily acc? 
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i imilar superiority of intelligence. 
[a БО available "ей 
the possible effect I. Q., might have Fee ре 
p ынд Hid Н І. О., and 
етер extent of this T rela- 
tionship needs to be determined by careful re- 
E es is reason to believe that experience in 
playing a musical instrument results in а higher 
level of music reading ability because instrument- 
al performance tends to crystallize the concepts 

of pitch and tonal relationships as expressed by 
musical notation. This information Was Secured 


for each subject participating in Phase II and is 
summarized in Table XXa. 


S that more “gifted? 
ning than “ауегаре” Sub- 


-Егайе subjects 


tal fourth-grade 
Sample; the 37 Sixth-grade Subjects are 48.7 per- 


ample, ” as in Table ххь. 
~ Confi 6 urations— The 
results of the testing in Р 


hase II will be present- 
ed in terms of each task, rather than in terms of 
each sequence of tasks, inorder t 


Two kinds of SCOres will beu 
the results; the “percent Correct" sco res, and 
the *'rate-of-learning^ score. These have been 
described earlier in the report and it wil] be re- 
membered that **percent correct” score for each 
item was calculated by summing the number of 
correct responses to the item and dividing by the 
total number of trials for the item; Summing the 
“регсеш correct" Scores for each item and dj- 
viding by the number of items Save the “mean 
percent correct"' score for the total task, 

One of the first Steps taken was to ascertain 
the reliability of the tests used in Phase П. The 
subjects in each of the two sequences at eachgrade 

] were divided into two equal SToups, assign- 

ан to groups being made at random. The “ау- 
© апа ‘‘gifted’’ pupils were combined, as 


were the three schools. Ifthe differences а 
the means for each of these two groups 5 d 
significant it could be assumed that the tes 4 дар 
reliable, and that any Other differences we ea 
to factors other than the tests. Table XXI жең 
sents the data for both the learning of ин ed 
figurations and the learning of the song. T d 
values for the split groups in e ach grade pet 
each sequence show that no real difference e side 
between the means and the tests may be Tom 
ered reliable for this particular sample. Its = 
be pointed out that ће scores on the song ү ie 
Sent the percentage of correct responses 
tonal configurations within the song. — 
Analysis of the **mean percent correct ё ad d 
for tonal configurations that were presente ЕМ 
Separate task is summarized in Table XXII. t all 
highest possible score is 100, indicating tha ihe 
five items were correct on every trial taken by be 
Subject. The reader is reminded that for а 7 
А this was the initial task, but for Sequence che 
was the second task and was performed „ш 
Subject had had practice inlearning the song. Vers 
igher means for average and gifted subjec A 
each grade who were tested in Sequence B ai, 
probably due to this practice which they receiv 


” -hows 
Examination of the individ ual tabulations sho 
that a total of 12 fourth- 


grade pupils had 


were more 
quence В. 


: su- 
Table XXII shows the r Meer ph 
ifted subjects as compare 


5 
ng the study. Of more es 
Snificance is the Map ape 
each SToup іп Sequence A With its parallel Е hich 
in Sequence B to ascertain the degree to на: 
РгасЦсе опа related task influences test SCO 
The <> tests are Shown at the top of 
These data appear to support the following СО? 
Clusions: 


. age 
ds Practice iS more necessary for pes 
fourth-grade Pupils than any other Е and- 

is STOUp needs to develop an under 


. 2 m с 
Ing of the concepts involved in reading 
sical notation. 
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Sequence A 
Sequence A 5. 6th grade, average, 

3 e ае н Sequence В 6. 6th grade, average, Sequence 5 

3 ves свае gifted, Sequence A 7. 6th grade, gifted, Sequence 

+ 4th не gifted, Sequence B 8. 6th grade, gifted, Sequence 

2 > 


112 = 3.4689, there is a significant dif 

level. : : 
134 = 1.9550, there is no si 
t56 = 1.6422, there is no sj 
178 = .1897, there is по si 


ference between the means at the .01 percent 


gnificant difference between the means. 
gnificant difference between the means. 
gnificant difference between the means. 


2. Gifted pupils in both grades already appear 
to understand these con cepts and derive 
comparatively little benefit from practice 
since their scores are alr eady relatively 
high. 


3. Average sixth-grade pupils do not show sig- 


writer was particular: 


The ily interested in at- 
tempting to identify the rat 


e at which pupils in 


cluded those subjects of 

learning would be possible. 

e no gains from 

the initial trial to the final trial or who 
earned a lower score on the final trial than 
on the initial trial, received an RL Score of 
0. These subjects are included in the data 
because the opportunity for learning was al- 
ways present. Thereason for assigning a 0 
to the subject whose score became poorer 
was a device to avoid the problems inherent 
in working with negative values, 

3. The RL score is based on the total test and 
is not, as is the case with the mean “рег- 
cent correct" score, ап average of the per- 


formance on each item. This works to the disad- 
vantage of the subject who requires many trials to 


arning the song, the 
nal configurations as 


the difficulty encountered in Scoring the tasks in 
this study is duetothe complex nature of the tasks 
themselves. Research activity in the field of mu- 
i alling attention to the existence of a 
em, fails to recommend more ade- 
quate methods of Scoring. " 
The highest RL score а subject could earn is 
100, possible only if the su 
ond trial on the test to 


€ XXIII summarized the RL scores 


group in Sequence 
gifted pupils in Six 


tthesign 
tween the means of the “percent correct’? scores; 
а result which was p 


redictap] e of the sim- 

ilarity of the бох ble becaus 
The reader will 
ferences in m ean 
grades had oj 


recall that for Phase I the ed 
SCOres for fourth and S 
Of 1.8397 and it was conclu 


en 
Verage and gifted or betwe 
the two sequences. The t test E. a tof .9169 


чан 


_ Ња 
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#12 = 1.8036, there is 

{34 = 2.1846, there is 
level. 

t56 = . 5356, there is no signifi 

{78 = . 8592, there is по signifi 


These data parallel the results of the “t” tests 


made to test the significance of the differences 
between the means of the ‘‘percent correct’? scores, 
a result which was predictable because of the sim- 
ilarity of the two kinds of scores. 

The reader will recall that for Phasel the dif- 
ferences in mean scores for fourth and sixth 
grades had given a t of 1.8397, and it was con- 
cluded that the differences were not significant. 
The 138 subjects inthe present sample were con- 
Sidered only in terms of grade; no differentiation 
being made between average and gifted or between 
the two sequences. Thet test gave a t of . 9169 
and it was again concluded that the difference be- 
tween the two grades, interms of mean RL scores, 
Was not significant. The mean for the fourth grade 
was 26.17; for the sixth grade, 31.67. 

It will be remembered that in Phase I the 


Scores for the three schools showed à significant 
School B and the other 


difference existing between 

two schools, but that the differen? e between 
Schools A and C was not significant. The sample 
for PhaseI was a random sample while the sub- 
jects for Phase П represented two distinct levels 
of competence, with only the averag? group 5е- 


lected on a random basis. To ascertain whether 
he lowest in terms of 


School В continued to rank t : 
achievement, the RL scores Оп the tonal configu- 
ations for the three schools, combining grades 
and groups, were analyzed. ‘Analysis of variance 
Showed that there was 4 significant differenc® br 


tween the thr level, F = 4. 
ree schools at the 5% level, 3 

This data is presented in Table XXT The “t” 
eof the differences be- 


renis for the significanc 4 
ween schools showed the following: 


1. Between Schools А and р, nam and 
the difference is not significant. 

2. Between Schools А and C, t= 2.9211, and 

ificant at the one per- 


the difference is sign 
cent level. 

3. Between Schools В an 
the difference 15 


ас, t= 1. 764, and 
not significant. 


The means of these three schools were: S 
A, 21.47; School B, 26.99; and School C, 42-41 
The data tends to support the following cone 
Sions: 


4th grade, average, Sequence A 5. 6th grade, average, Sequence A 
4th grade, average, Sequence B 6. 6th grade, average, Sequence B 
4th grade, gifted, Sequence A 7. 6th grade, gifted, Sequence A 
4th grade, gifted, Sequence B 8. 6thgrade, gifted, Sequence B 


no significant difference between the means. 
a significant difference between the means at the . 05 percent 


cant difference between the means. 
cant difference between the means. 


1. Subjects in School С performedata high 
level of competence in both Phases I and П. 

2. Subjects in School A performed similarily to 
those in School C for Phasel but on Phase II 
their performance more closely paralleled 
that of School B. 

3. Subjects in School B ranked lowest in per- 
formance on Phase Ibut paralleled School A 
on Phase Il. 

4. The change in position of School A for the 
two phases may be due to the differences in 
samples. Itis also possible that Phase I 
represents a more accurate evaluation of 
the pupils in the sc hools than did Phase I. 
If this is the case, then it may be concluded 


that subjects in Schools A and B are similar 
ool C are different, 


while the subjects in Schi 
in terms of gene ral competence in reading 


music. 


r Sequence A, the 


2. Test Results: Song—Fo 
period of the 


song followed a 1 -minute learning 
for Sequence B itwas the 


five tonal configurations, 
the “percent correct" 


initial task. Analysis of 1 
0 the highest possible score, 


scores, with 10 
showed а total of only 10 perfect scores. It will 
be remembered that in scoring the song, the score 
unit was the tonal configuration within the song. 
ubjects, one in each se- 
quence, and 8 gifted sixth-grade subjects in Se- 
quence A account for the perfect scores. This is 
a partial indication that learning a song is a more 
difficult task than learning individual tonal config- 
urations, for which 32 perfect scores were ob- 
tained. e data in Table XXV, again, indicate 
the superiority of the gifted subjects. Compari- 
sons between the groups for the two sequences 


show the following: 


1. For the 4th grade, average, Sequence A with 
Sequence B, + = 3.0018, indicating that the 
difference between the means is Significant 
at the one percent level. 

2. For the 4th grade, gifted, Sequence A with 

B, t= 2.0170, indicating по significant dif- 
between the means. 

6th grade, average, Sequence A with 


9728, indicating a significant differ- 


ference 
3. For the 
В, 1-5. 
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ence at the .001 level. 

4. For the 6th grade, gifted, Sequence A with 
В, t= 4.5811, indicating а significant dif- 
ference between the means at the . 001 level. 


These results support the following conclusions 
regarding the importance of practice in learning 
to read a song: 


1. Only thegifted fourth-grade subjects did not 
appear to be influenced by the presence Or 
absence of practice on tonal configurations 
prior to learning à song; thisis, of course, 
true only for this particular sample. 

2. Forallother groups, especially at the sixth- 
grade level, prior practice on tonal config- 
urations resulted in significant improvement 
in reading the song. This was not evident 
when the scores On tonal configurations 
were compared on the pasis of prior ргас- 
tice onthesong. This tends to indicate that 
the reading of songs should be preceded by 
some prior related activity, not necessarily 
learning tonal conf igurations, 50 the sub- 
ject moves from the simple to the complex 
and is aware of what is taking place. 


No attempt was made to comp? re the scores 
for the song with those of the tonal configurations, 
because, in view of the different nature of the two 
tasks, such comparisons could not be completely 
Justified. 

The “rate of learning" scores for the song 
were analyzed in the same manner as those for 
the tonal configurations. Only two gifted sixth- 
grade subjects who were in Sequence A had a per- 
fect score on the initial trial of the song. Eight 
additional gifted subjects had an RL score of 100, 
as compared with the 21 subjects having RL scores 
100 on the tonal configurations: Table XXVI gives 
the data for the eight groups; and it is again eV17 
dent that gifted subjects perform at a much high- 
ег level. With the exception of the fourth-gT ade 
gifted subjects, all groups in Sequence A showe 
higher RL scores than those in Seque | 
One compares Table XXVI with Table ххш it may 
be seen that the RL scores on the song are con- 
Siderably lower than the RL scores on the tonal 
Configurations. Thet tests between the various 
groups com paring the two sequences showed the 
following: 


1. For 4th grade, average, Sequence 
{ = 1.0677, indicating no signifie 
ence at the 5 percent level. 

2. For 4th grade, gifted, sequent $ bn. 
t= .1847, indicating по significant ша, А 

3. For 6th grade, average, Sequence 5 УЛ” "i 
t =7=.6322, indicating а significant pue. 
ence at the . 001 percent level. | 

4. For 6th grade, gifted, Seque differences 


e A with B, 
ant differ- 


9 Ge net t the 
t= 2. 6444, indicating US i gnif icant at 


between mean RL scores is si 


the .02 percent level. 


The results agree, in general, with the t tests 
made forthe “percent correct’’ scores on the 
song and would indicate that the more musically 
mature and experienced sixth-grade subject is 
better able to profit from certain types of related 
practice than the fourth-grade subject. 

The four groups in each grade were combined 
into a single group and the RL scores for the two 
grades compared. Thedata 5 ho wed a mean of 
13.15 for fourth grade; 21. 19 for sixth grade. The 
t test gave at of 2.0532 andthe difference between 
the means of the two grades was considered to be 
significant at the 5 percent level. This would al- 
so support the conclusion that the maturity of the 
sixth-grade subject gives him an advantage when 
the rate of learning is considered. 

It has been established thatfor Phase 1 and for 
the tonal configurations of Phase I, the differences 
between the three schools was significant. The 
means of the RL scores on the song, combining 
all groups within each school, were as follows: 
School А, 19. 09; School B, 17.48; and School C, 
19.10. Analysis of variance showed that these 
differences were not signifi с ant at the 5 percent 
level, Е = . 4182. 

It may be concluded that the samples from the 
three schools vary considerably when comparisons 
are made in terms of achievement scores, such 
ecting previous musical training and 
experience; but that these same samples do not 
vary significantly when comparisons are made in 
terms of the ability of the subjects to learn the 
song. The tasks which servedas a basis for com- 
paring schools in Phase I and the tonal configura- 
tions of Phase П evidently gave the subjectsa 
greater opportunity to utilize previous experiences 
than did the learning of the sons. This also sup- 
ports the gener ally accepted point of view that 
reading а song is a highly complex process. 
esults: Identity of Tonal Configura- 


this aspect о 
score for eac 


test and the same SCO 
oment correlations for all groups 


on each item were © alculated and the basis for 
comparison is the size of the resultant p." The 
raw data will not be used in this section of the re- 
portin order to simplify and condense the presen- 
tation. For the same reason, no attempt is made 


to retain the identity of the individual schools. 


preliminary correlations for the complete test 


were calculated, with the percent correct" score 
for the five-item test СО mp ar ed to the ‘‘percent 
correct" scores for the tonal configurations with- 
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in the song. This procedure gave the following 
correlations for the combined schools: 


1, Ші average, Sequence А т = .32 
2. 4th, average, Sequence B Pa. 
3. 4th, gifted, Sequence А г-.42 
4. 4th, gifted, Sequence В r-.44 
5. 6, average, SequenceA  r- 58 
6. 6th, average, Sequence B г-.91 
7. 6th, gifted, Sequence А  r-.4l 
8. 6th, gifted, SequenceB г=.60 
9. 4th, average and gifted, 

Sequence A р „Та 


10. 4th, average and gifted, 

Sequence B (2.14 
11 6th, average and gifted, 

Sequence A r= то 
12. 6th, average and gifted, 


Sequence B yos „01 


It will be noted that the correlations for the fourth- 
grade average and the sixth-grade gifted are con- 
siderably higher for groups in Sequence B than for 
those same groups in Sequence A. The reverse 
is true for the sixth -grade average groups, and 
there is little difference between the two sequences 
when the fourth-grade gif ted groups are consid- 
ered. When Ше two ability groups are combined, 
the differences between sequence for the fourth 
стаде become less evident and the sixth grade 
shows a higher correlation for Sequence A. It is 
interesting to note that these correlations are 
higher than those obtained for the individual abil- 
ity groups. For Sequence A, fourth-grade aver- 
age was . 32 and fourth-grade gifted was . 42, but 
when the two groups are combined the ‘‘r’’ be- 
comes a.72. It is the opinion of the writer that 
these higher correlations resultfrom the fact that 
the average subjects are at the lower end of the 
distribution and the gifted subjects at the upper 
end, thus yielding a higher correlation than when 
each group is treated separately. 

These preliminary correlations show that there 
is a tendency for tonal с onfigurations to retain 
their identity. Inorder to present more complete 
data, the same kinds of correlations were calcu- 
lated for each of the five tonal С onfigurations 
used in Phase П. These will ђе discussed indi- 
vidually and then summarized at the conclusion of 
the section. In preference to а summary table, 
the means for each item, together with the cor- 
relations, will be presented at the time each item 
is considered. Mx will designate the mean of the 
item when presented in the five-item test, and My 
Will refer to the mean of the same item as it ap- 
peared in the song (see top of next column). 

This item was the first meas ure of the song 
and one would anticipate, because of its unique 
Character, that subjects would experience little 
difficulty in recognition. The m eans of the var- 
ious groups show that sixth-grade subjects are 


Sequence A 


Average Gifted 
Mx My Mx My 
4th 23 66 T1 89 
>= „10 re 208 
6th 49 87 78 99 
r=. 45 = SODH 


Sequence B 


Average Gifted 
Mx My Mx My 
4th 14 64 94 87 
r= -.56 Г = .44 
6th 72 61 88 89 
r= ol r= -.04 


more familiar with the item than are fourth-grade 
subjects, the ехсе ption being the average sixth- 
grade group in Sequence B. There is considerable 
variation for the correlations and only three of 
these appear to show that subjects recognized the 
item when it appeared іп а context different from 
its initialpresentation. For Sequence A the aver- 
age sixth-grade group seemed to be most aware of 
the item; for Sequence B the average sixth-grade 
and gifted fourth-grade groups are consistent. The 
high negative correlation for the average fourth- 
grade group, Sequence B, indicates considerable 
shifting of position from task to task. This shift- 
ing is not unusual but it does present problems of 
interpretation. The remaining four correlations 
are so low as 10 imply that little identification took 
place. 

Item 2 is the simplest item of the five and oc- 
curred twice within the song.(next page). Treat- 
ment 115 the compar ison of the individual item 
(Mx) with the first appearanceof the same item in 
the song (My). TreatmentII compares the individ- 
ual item with the second appearance of the item in 
the song (Mz), and Treatment III shows the rela- 
tionship of the two items (My and Mz) as they ap- 
pear within the song. 

All groups showed relatively high means on the 
item, the lowest means areon the second appear- 
ance ofthe item in the song for the average sixth- 
grade group in Sequence B. In general, the indi- 
vidual item shows higher means than the item with- 
in the song. For Treatment I, the only group to 
show consistent recognition is the gifted fourth 
grade, Sequence A. The data showed that the first 
appearance of the item in the song tended to con- 
ceal its true identity. The means for the second 
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Treatment I 
Sequence B 
Sequence A ——À 
Average Gifted Average Gifted 
Mx м мо му Mx М Mx му 
FS 64 91 98 76 44 83 76 
Эн г= .10 T= 299 т= „02 r= 228 
92 
69 85 96 98 15 53 92 
Gir r= .20 г= .29 fe 208 г = -. 14 
Treatment II 
Me м Mx м М Ма Mx ма 
4th 62 33 91 76 76 15 83 67 
r=-.17 те: „38 rz .41 r= „53 
6th 69 58 96 90 75 19 92 67 
Pu „32 r= .08 Ғы „54 r= .02 
Treatment Ш 
SOM эму м му Mz My Mi 
4th 64 33 98 16 44 15 82 67 
rs „19 r= .52 г= .48 r= .93 
6th 85 58 98 90 53 19 92 67 
1:3/120 Ге 08 r= .30 г= .26 


appearance in the song, Treatment П, are uni- 
formly lower for each g roup than the means for 
the first appearance, There is a slightly higher 


or the second appearance 
of the item as evidenced by the five 


grade is more apt to reco 


invent" а sae 
more appropriate, in terms of 


total situation, than the item actually presented to 
him, 


and five are , 30 ог Item 3: (SMF В) 


WS а Consistent per- 


igher “rg” Shifting 


Sequence A 
from group to group for each treatment, 


The lack Average Gifted 
of stability suggests that 5 ub jects are not aware Mx My Mx My 
that the item exists ава note-grouping, but are 4th 28 37 "s "8 
reading it as unrelated individual notes, The two be ad a. 
appearances within the song emphasize the group- 
ing because the item has the same rhythmic struc- 6th 45 БҮ 88 
ture in both instances and 1S a complete measure, r= ‚20 ЖЕН 
the measure bars setting it apart as ап entity. 
The data also show that the «7757 in Treat- Sequence B 
ments I and Ш are higher for Sequence A than for 
Sequence B; while in Treatment II, Sequence В 16 Average Gifted 
the preferred sequence, This Suggests that for Mx My Мі” у 
this item those subjects having prior practice on 4th 40 5 aa 42 
the individual items carried over more recogni- TET NO 
tion thanthe subjects who began with the Song and 
lated this experience to the individual items. 6th 58 23 83 64 
The data also show that for Sequence B the fourth те „39 rs .13 


—< => 
> на 
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Although only the gifted fourth grade in Se- 
quence A shows high mean scores on the item, all 
but one of the *'r's" is positive, with six being 
.39 or higher. The disjunct, but sequential char- 
acteristic of this item appears to simplify the 
recognition process since subjects remember the 
uniqueness of the item even though they may not 
always be able to sing it correctly. The fourth- 
grade subjects show a much higher degree of rec- 
ognition and greater consistency than do the sixth- 
grade subjects, although the latter earned higher 
means throughout. 

For this item the subjects in Sequence B show 
a higher degree of recognition than subjects in Se- 
quence A which suggests thatthe appearance with- 
ina song is more meaningful than an isolated 


presentation. 
Item 4: (FMRD) 


Sequence А 


Average Gifted 
Average NT 
Mx My Mx My 
4th 51 22 91 54 
r=.25 r=.18 
6th 64 44 90 15 
г=.17 г = .65 
Sequence В 
Average Gifted 
Mx My Mx My 
4th 18 10 97 41 
г=.35 шаг: 
6th 5 24 93 52 
ET а 


Item 4 proved to be more difficult in the Bong 
than as an individual item and this is shown by the 
lower My values. The chief reason for this kon 
because it began the interval of a fourth above 
final tone of the previous measure, and subjects 
experienced difficulty in locating the correct tone, 
Although only three of the correlations are · s 
higher, all “r’s” are positive, indicating recog. 
Nition of the item by many subjects. The size 
the “rs” for the various groups прав, 
consistent pattern, nor does one 5 
to be осе to the other al though the gites 
Subjects in Sequence А do better than their p 
11 Sequence B. Е 

The means of the gifted subject 
quences continues to ехсее 
erage subjects, a behavior 
tent throughout the testing (Item 5)- 


of the item appears to be retained since all “178” 
are positive. For Sequence A the fourth-grade 
subjects, with higher ts” than sixth-grade sub- 
jects, apparently are more aware of the individ- 
ual character of the item even though their mean 
scores are lower. In Sequence B, the sixth grade 
has the higher “1757. The data show that sub- 
jects within most groups shift their position from 
task to task, a shifting that has been evident in the 


analysis of each item. 


Item 5: (SLSFM) 


Sequence A 


Average Gifted 
Mx My Mx My 
4th 19 31 84 86 
г = .50 r=, 34 
6th 51 64 95 97 
r=.39 г: „18 
Sequence В 
Average Gifted 
Mx Му Mx Му 
4th 54 14 100 70 
r=.44 r= 0 
6th 73 42 91 81 
r=.49 -.12 


of Тһе Findings—1. Тһе average 
fted sixth-grade subjects in Se- 
quence BS how a higher degree of recognition of 
tonal configurations than do these same groups in 
Sequence A when the correlations bet ween items 
on the song and individual items is considered. 
This suggests that these subjects identify tonal 
configurations presented individually more read- 
ily if they have had previous opportunity to ob- 
serve those configurations in a song. It also im- 
plies that for these same groups, practice on ton- 
al configurations does not lead to greater recog- 
nition of these same configurations in the song. 
The reverse holds for the average sixth-grade 
subjects in the sample. 

2. The subjects within all groups do not per- 
form consistently, аз € V idenced by the variation 
inthe size of the ере” for each item. The fac- 
tors which cause this shift cannot be identified at 
the present time, but it is reasonable to assume, 
in view of the reliability of the test, that the effort 
and attitude of the subject are significant factors. 

3. The data from the present study does not 
offer strong support for the point of view that ton- 
al configurations tend to retain their identity in 
different contexts unless one is willing to accept 


4. Summary 01 
fourth-grade and gt 
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the coefficient of correlati on obtained for the 
combined gifted and average subjects in each зе- 
quence as sufficient evidence of such retention, 
These were all relatively high, .61 to. 76, and 
may represent a more dependable value than the 
**r's"' obtained for each of the sub-groups. 

4. The item, if it isto retain its identity from 
one situation to another, must first be learned. 
The testing carried on in Phase Il did not result 
in such learning for the majority of the Subjects. 
Shorter, more numerous practice Sessions would 
eventually result in learning the items and a sec- 
ond task presented atthattime might have signif- 
icantly raised the *'r's" for the groups. 

5. The variability of the “176” makes itim- 
possible to identify àny one sequence of tasks as 
the most desirable for the development of item 
recognition. The addition of à third task might 
well have provided the kinds of data пеедеа to 


» the gifted Subjects in each 
grade for each sequence had higher scores than 
cts. In 20 of the 28 compar- 

sixth-grade gifted subjects 


fourth-grade gifted. This 
е “size” of the сер» 


recognition by all the various groups. Item 3 and 
the two appearances of Item 2 within th 
pear to retain this identity bett 


5 ose of 
the fourth grade in 8 of 12 comparisons, This is 


a more reliable indication of what is taking place 
and shows that fourth-grade subjects tend to ге- 
tain items that are either very simple or which 
have distinctive characteristics. When the item 
is neither of these, the sixth-grade subjects are 
better able to detect the subtle diffe rences and 
identify the item in a new context, | 
9. It may be concluded that the sixth-grade 
subjects in Phase П of the study not only demon- 
strated a higher level of ac hi evement but also 
showed a greater degree of item recognition for 
the test as a whole, 


5. The Influence of Instrumental Instruction on 


Test Scores—It has been mentioned earlier that 


many music educators feel that experience in pay- 
ing a musical instrument results in a higher leve 
of music reading ability, because instrumental 
performance tends to clarify the abstractions of 
musical notation. A total of 53 subjects in Phase 
П had reported instrumental training of at least 
12 months or more. The ‘‘mean percent correct 
Scores on the tonal configurations were used aS 
the basis of com paring the pupils within groups 
who had training with those having no such train- 
ing. The results are summarized as shown at top 
of next page. 

The “t? tests made show that differences be- 
tween the means were not considered significant 
at the 5 percent level and it may be concluded that 
the instrumental training had little effect upon the 
test scores of the Subjects in the sample. 

There are several factors which need to be con- 
sidered before any Conclusions can be reached: 


1. The groups compared are all relatively 
small. 

2. Piano was the instruemtn mentioned most 
frequently, and this may not influence music 
reading ability, in the sense of this study, 


ûs much as asingle-line instrument like the 
violin. 


For these reasons the writer is not willing to 60 
beyond the limit 


ations of the present study. " 

. 6. Relationship Between Scores on Tonal Con 
табов and the Teacher Ratios оп тола 8- 
Ser Tests— The writer has ee and Kwalwas 
Tyee 


t ї еп. Тћеге агеталу сэрээ 
ое chi is study 4 
fined the m 8 children and this 


ly gifted subject as the chil 
whose musica] achievement iu mue to p 
е majority of his classmates 
evement results from favora 1- 
usical activities, general ШЕ іі 
іс, and performing skill in Bine 
The experienced tea ae 
B Contact with the children over an is 
i ould be able to evaluat? 


қару. 
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Vithout Training 


With Training 
Mean “фр? 


т 


6th Gifted, Sequence В 


гіп ће report showed the 


correlations between the teacher’s ratings and the 
Kwalwasser Test to be relatively low for all 
schools. This is to be expected because the two 
criteria do not measure exactly the same things. 
To ascertain which criterion measure, if any, 
had the closest relationship to the reading of to- 
nal configurations in Phase п, the product- 
moment correlations were computed for the aver- 
age and gifted s ubjects in each school. These 
correlations are represented as AB and AC: 


Data presented earlie 


1. A represents the scores on tonal configura- 


tions. 
2. B represents 
3. C represents 
test. 


the scores on teacher ratings. 
the scores on the Kwalwasser 


These *'r's" are given for each grade in each 


School as follows: 


Av. Or 

School Grade Gifted rAB ГАС 
ју ЈА 4 А . 60 -.06 
2. А 4 G 21 -.10 
3. A 6 A .08 -.14 
4. А 6 С .48 -.36 
5. B 4 A 519 17 
6. в 4 G -.03 499 
7. B 6 А 2508 -.07 
8. В 6 с ein 11 
9. С 4 А 2425 .42 
10. С 4 G . 03 aT 
11. C 6 A „13 -.69 
12. c 6 е -.10 -.09 
These data show а lack of consistency, not only 
from school to school but for the various groups 

аз well. The data show the following: 
1. Neither of the criterion measures are able 
to show a consistent r el ationship with the 
task used in Phase I, although the teacher 8 
rating correlates higher thani the Kwalwas~ 
ser for all groups 1n School А. en to the 


reference is giV 


2. Е 
or School B the p though the сог relations 


Kwalwasser test al 


Group N Mean 
4th Average, Sequence E 15 39.3 1 13 
4th Average, Sequence B 16 66.8 3 85.3 1.3536 
4th Gifted, Sequence A 4 19.3 12 85.0 17763 
4th Gifted, Sequence B 6 90.3 5 98.0 1.1936 
6th Average, Sequence A 17 54 5 57.6 „3339 
6th Average, Sequence В 12 66 6 68.5 „1952 
6th Gifted, Sequence A 6 84.0 16 92.5 1.1961 
5 91.8 9 90.7 


are so low as to indicate that such a basis is 
not reliable. School C is similar to School 


B. 

3. When the gifted subjects are considered, the 
higher “178” occur for the Kwalwasser in 
four out of six such comparisons. 

4. When the average subjects are considered, 
the higher ‘‘r’s’’ occur for the teacher’s 


ratings. 


s of the data obtained, the writer finds 
ent with the many other investi- 
gators who have attempted to establish some basis 
for identifying musically gifted individuals, that 
this basis is yet to be discovered. It is possible 
that the most adequate basis for making such 
identification is the performance on the tasks in 
Phase П rather than outside criteria. Itis also 
quite possible that the absence of а large number 
of significantly high “‘r’s’’ and the lack of a соп- 
sistent pattern shows that teachers are not ina 
position to adequately evaluate the music reading 
ability of their pupils because there is compara- 
tively little орро rtunit 


y for individual attention 
within the classroom situation. 


On the basi 
himself in agreem. 


V. Summary and Conclusions 


Summary of Major Findings Based 
on the Analysis of Data 
o significant difference between 


1. There 15 n 
terms of the ability to read mu- 


boys and girls in 


sic. 

2. For Phase Ithere were no significant differ- 
ences between the scores earned by fourth-, fifth-, 
and sixth-grade subjects. For Phase П there 
were no significant differences between fourth and 
sixth grades either in terms of general scores or 
in terms of the rate of learning scores for tonal 
configurations. However; the RL scores on the 
song showed à significant difference between these 


two grades. Ен 
3, When “percent correct" scores were used, 


there were S i gnificant differences between the 
three schools; School C, while comparable to 
School A in Phase I, was higher than the other 
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two schools in Phase П. School В лаа Loyer 
on Phase I and was comparable to Sc is pn 
Phase Il. There were no differ ences when 
scores on the song were used as a basis for com- 
P. The analysis of data for Phase I showed: 

( a. Тһе subjects experienced considerable 
difficulty in reading tonal configurations 
which commonly appear in the songs they 
sing. 

b. The common reading errors revealed a 
general tendency to respond to the out- 

ward shape of the configuration, but to 
but to substitute a Similar, more famil- 
iar one being read. 

. Considerable guessing takes place when 
subjects attempted to read, as evidenced 


by the number of different errors which 
were made, 


е. The relationship between + 
to the aural and visual 
anitem was not as 
“тв? of .50 to . 70. 
not as accurate a со 
have been made. 

5. The teacher’s ratings of subjects were com- 
pared with the results of the Kwalwasser test and 
showed very low correlations. 

6. Gifted subjects in Phase II consis 
better than average subjects on each ta: 
knew more about reading music, 
the “регсеп! correct?’ 5 cores, 
material at a rate three to four ti 

average subjects. 

7. The song was a moreco 
individual tonal configurations 
sequence of tasks. 

8. Comparisons between se quence 
various groups, using the scores earne 
configurations as a basis for compariso 
the following: 

а. The ‘‘percentcorrect’’ scores for groups 
in Sequence B, with prior practice on 
the song, were not Significantly higher 
as a result of this practice than the 
scores for groups inSequence A who did 
not have practice. Only the fourth-grade 
average group seemed to benefit from 

b. The RL scores for groups in Sequence B 

were not significantly higher than the 

scores for groups in Sequence A except 
for the gifted fourth-grade Subjects. 


he responses 
presentations of 
high as expected, 
This is, however, 
mparison as might 


tently did 
Sk. They 
as evidenced by 
and learned the 
mes faster than 


mplex task than the 
; regardless of the 


S for the 
d on tonal 
n, Showed 


9. Comparisons between se quences for the 
various groups, using the scores earned on the 
song showed the following: . 

а. The “регсепїсоггес!” scores for groups in 

Sequence A, with prior practice on tonal 
configurations, were significantly higher as 
а result of this practice than the Scores for 
groups in Sequence B who had no practice: 
Only the gifted fourth-grade group seeme 
not to benefit from such practice. Е 
The RL scores for sixth-grade groups in 
Sequence A were significantly higher than 
the scores for the sixth-grade groups in Se- 
quence В. There were, however, по signif- 
icant differences between fourth -grade 
groups in the two sequences, А 
10. Analysis of the degree to whichtonal config- 
urations retain their identity from one situation to 
another showed that: 

a. Average fourth-grade and gifted sixth-grade 

Subjects show greater recognition when the 
Sequence from song to configuration is fol- 
lowed. 

Average sixth-grade subjects show greater 
recognition when Sequence A is followed. 
©. Considerable shifting of the position of the 
Subjects from one task to another takes 
place within all gr Oups. This variability 
makes it difficult to accurately interpret 
the ав». 
airly consistent recog- 
rade groups; Items 1, 
ily recognized by sixth- 


1. The scores of subjects inthis study who r e- 
ед instrumental traini 


Scores of subjects who did not 
have such training 


12. The identification of gifted and average 
Subjects on the basi 


reliability in terms of this Study. 
Im 


plications of the Study 
Ine Study 


l. The time allowed 
Was not suff icient to 


4 E + е 
rate ОВ ингэ notation 181088 


likely to result when greater emphasis is place 


: ta- 
: ing the significance of the по 58 
tion rather than upon resp и Шыр? of must 
cal materials, 


3. The mus 


ци. А 
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more rapid rate than average children, a fact that 
must be considered in planning the music activi- 
ties for the grades. 

4. The data did not clearly establish that one 
sequence of tasks is superior to the other in de- 
veloping music reading competence. It did show, 
however, that although prior practice on the song 
did not appear tohave much effect upon the scores 
obtained for learning individual configurations, 
learning the song was made more effective through 
prior practice on tonal configurations. The fact 
that the song is usually approached as a complete 
entity, as it should be, makes it difficult for chil- 
dren to then recognize the individual elements of 
the song when they are isolated. 

5. If the music reading is approached by means 
of the configuration, greater emphasis needs to be 
placed upon helping children recognize the shape 
and design of such configurations. Subjects in 
this study were unable to identify the configura- 
tions inthe song which they had previously learned 
as individual items until they were asked to make 
such an identification. 

6. The results of the study showed that these 
sixth-grade children didnot perform significantly 
better than the fourth-grade c hildren, although 
they clearly demonstrated a faster learning rate 
for the song. This implies that very little actual 


growth in music reading skill takes place beyond 
e the children are 


the fourth-grade level, becaus 1 
not given the necessary musical tools with which 
to work. The tools in this sense of the work are 
the knowledges and understandings about musical 
notation which are developed through independence 
of action. 
7. Instrumental instruction did not seem to be 
a factor in the music reading abilities of either 
the average or gifted children in this study. The 
training will exert a greater influence if greater 
emphasis is given to the meaning of the musical 
notation rather than to mere mechanical responses 
to visual stimuli. Itis highly probable that шинэ, 
mental training at this level is more concerne 
with developing control over the instrument so 
that he is not able to give his attention to other 


Problems of performance. | 

8. Тһе procedures followed in this study are 
adequate, but im pro vements need to be made in 
methods of scoring and evaluating the data. 

9. The problem of the visual perception Ot iu 
Sic symbols needs to be extended to ранен 
whether tonal configurations learn edin ee 
are readily perceived when presented in 0 


tonalities where the notation is different. 1 
10. The relationship between auditory bcd 
ual perception needs to be more clearly esta 


ed. Music educators recognize the interdepend- 
ence of the two, but the degree to which one de- 
pends upon the other needs to be identified. Many 

reading errors made inthis study were caused by 

the sub ject’s inability to “hear internally" what 

the stimulus should sound like and, consequently, 

were unable to control the voice. 
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ONE OF THE most important and infl uential 
public officials in America is the public school 
superintendent of schools. Within the legal struc- 
ture of school government extant in each state, 


the superintendent of schools is an executive offi- 


cer directly responsible to a local board of con- 


trol for the maintenance, operation, and admin- 
istration of the public schools established within 
the district over which the controlling board has 
legal jurisdiction. 

Early New England public schools were con- 
trolled and m aintained by decisions reached in 
town meetings. Later as the population increased 
and school enrollment grew, appointed and elect- 
ed representative poards of control came into be- 
ing whose function it was to maintain and operate 
the respective local public schools. After nearly 
200 years, the inc reased size of school enroll- 
ments had operated as one factor to precipitate 
increasingly complex and numerous problems re- 
lated to the administration of public schools. It 
became difficult for members of a board of con- 
trol to ad m inister the schools 25 had previously 
been possible under frontier conditions. During 
the 121 years since the first city superintendent 
of schools was appointed at Buffalo, New York, 
the functions and duties of the office of superin- 
tendent of schools have evolved gradually to their 
present day status. Functions of the office be- 
came more numerous and complex as increasing- 
ly better and more highly trained persons were 
available, demanded, and employed for that posi- 
tion. The position has become more exacting and 
Subject to varied and increased numbers of extra- 
and quasi-legal, political, ес onomic, religious, 
and sociological pressures. The effects of these 
pressures and demands on the time of the super. 
intendent of schools were suggested by Nortoni * 
when he stated that «The school Sup erintendency 
is becoming ап untenable position in some large 


ЖАЦ footnotes will be found at end of article. 


cities, and in many middle-sized and smaller 
communities it is not a bed of roses.” The nature 
of the public school superintendent’s functions, 
duties, and responsibilities is constantly evolving. 

The Evolving School Superintendents’ Task and 
Function— Perusal of the literature relating to the 
school superintendents’ task seems to warrant 
certain conclusions pertaining to changes іп em- 
phasis and nature of concepts held regarding the 
office. More recently there seems to be increased 
emphases upon the welfare of the child as the cri- 
terion for justifying and evaluating the effective- 
ness of administration. 
the responsibility and 
the improvement of instruction, including atten- 
tion to the selection of a well qualified staff, have 
been noted. It seems also, that a major evolving 
change of emphases in concept relates to the in- 
creased trend toward democratic policy making 

i aneously with need to 
delegate duties to others. This change involves 
wider utilization of staff, pupils, and school pa- 
trons in the community in formulating policy and 
solving problems. Increased emphasis is also in- 
dicated on the function of public relations. 

In 1923 the Depa rtment of Superintendence of 
the National Education Association queried school 
superintendents on their perception of the impor- 
tance of various functions deemed to be those ofa 
superintendent of schools, anda similar study was 
repeated in 1933 by the American Association of 
School Administrators. 2 Data obtained in each of 
these two years reve al the changing concept of 
school superintendents relative to the importance 
of various functions. During that decade increased 
emphasis Was noted on personnel problems and 
supervision of instruction, while а decrease in 
emphasis on importance of functions relating to 
budget, purchasing problems, census taking, and 
buildings and grounds was evidenced. 
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0, 1586 school superintendents respond- 
ed в ee conducted by the pi и 
ciation of School Administrators concerning the 
actual and ideal percentage of time distribution 
for superintendents among eight major broadclas- 
sifications of functions.3 Data revealed that the 
function of ‘‘instructional leadership was consid- 
ered as the one to which superintendents do and 
Should devote the most time. 

However, school superintendents expressed 
need for more time emphasis on 1) instructional 
leadership, 2) general planning, and 3) public re- 
lations. They indicated too much time was de- 
voted to 1) financial administration, 2) personnel 
administration, and 3) school plant management. 

The earlier studies cited, regarding the rela- 
tive importance of school superintendents’ func- 
tions were based upon what school Superintendents 
responding to a Study t 


hought. The first major 
effort to measure the Superintendent’s thinking 
and practice against 
made by Russell.4 After 5 


cluded on the listing. 

An identical list of functi: 
Superintendents of all (116) 
tral Association schools. 
also asked to indicate their judgment Concerning 
the relative importance of each function; but, in 
addition, they were asked to indicate the extent to 
which time they spent on each function was « 


Ons Was submitted to 
Oklahoma North Cen- 
Sup erintendents were 


about 
right, ” “Чоо little, ” ог “too much. > Of the 116 
superintendents included in the s t udy, 102 or gg 
percent responded. (Each of the Oklah 


oma super- 
intendents was the chief administrator in districts 
maintaining grades 1-12.) 

Responses from the jury of outstandin, 
in school administration served as а body of au- 
thoritative opinion and a criterion against which 
the thinking and perceived practice of Oklahoma 
school superintendents ко eri aee 7 

em and Nature o is Study— 
eril roble ofthis study was to det 
relationship between the importance of adminis- 
trative functions and the time expenditures of 
school superintendents relative to various admin- 
istrative functions. Answers to several 
questions were sought. 


5 Persons 


The gen- 
ermine the 


Major 


1. What is the relative importance of various 
administrative functions held by Illinois 


superintendents? | У 
+ To what extent do the perceptions of ме 
tance regarding administrative functions 
held by Illinois elementary school вае 
tendents agree with those held by experts 
the field of school administration? E 
3. What is the relationship between the per 
ceived time expenditures of Illinois en 
intendents relativeto various administran 
functions and their own concepts of the ге 
ative importance of various functions? чы 
- What is the rel ationship between the pe y 
Ceived time expenditures of Illinois Supe дни 
tendents relative to various adm inistrative 
functions and the importance of various Тиле 
tions according to authoritative opinion? 4 
Does the validity of perceived time етее 
itures of Illinois superintendents vary amo = 
Schools indifferent categories of enrollme 
size? : d 
How does the validity of perceived time "€ 
energy expenditures of Illinois super ща 
dents compare with those of Oklahoma аг 
Perintendents based on authoritative opinion? 


In seeking answers to these questions, the ӨНҮ 
was limited to Illinois Superintendents of separ of 
elementary school districts. Superintendents 
separate elementary school dist. 
SO serving as a su 
school district 


Was selected for the Study. Superintendents e 
Such school distri 


The study was 


that comparative data could be obtained. 
Superintendents responded. 


il 
1 dy so that a respondent wou art 
unlikely to rec a1] his response to the first P 


е time and energy he ехр out 
" “Too Little," ог в were 
t two complete response ue #0 
Cond part of the study. nich the 
mple, the manner in W 


were “Too Much, 


Right.” АН excep 
received to the se 
the size of the sa 
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sample was selected, and the number of responses 
to each part of the study, one is justified in con- 
fidently generalizing from the findings to all Ші- 
nois superintendents of separate elementary 
school districts. 

Responses to each of the two parts of the study 
were collated separately by category of school en- 
rollment size. Frequencies of response on im- 
portance of functions were assigned one of four 
weights, and frequencies of response on amount of 
time and energy were assigned one of three weights. 
Weighted ratings were conver ted into weighted 
scores for interpretative and statistical treatment 
and comparative purposes. Functions were ranked 
indegree of importance according to magnitude of 
the weighted scores thus derived. The same pro- 
cedure was appliedto respons es on perceived 
time and energy expenditures. 

Assumptions Underlying This Study— The pres- 
ent study is based on the assumptions that: 


1. The evolving С oncepts relating to the func- 
tions of the supe rintendent of schools and 
their relative importance have not changed 
substantially since the opinions of experts 
in school administration were obtained in 
1954. 

2. The chief difference between the administra- 
tion of a small school and a large one is not 
in the administrator's function but rather in 
the gener alization of duties in the former 
and in their specialization in the latter. 

3. The functions of the public school superin- 
tendent are substantially th e same regard- 
less of school size, comm unity economic 
background or social structure, and number 
of grades maintained by the district. 

4. The primary responsibility of the school su- 
perintendent is to develop a functional edu- 
cational program through the exercise of 
leadership as a director or boss; а chair- 
man of planning groups, а coordinator of 
Services, and an educational statesman. In 
discharging this responsibility certain func- 
tions are present which must always betaken 
care of, regardless of the nature of the ad- 
ministrative process. 

of Administrative Func- 


din Table I showing the 
e according 


_ Relative Importance 
tions—Data are presente 
Tank order of functions by importanc 
to the jury of experts, Ilinois elemen с 
district superintendents, and Oklahoma superin- 
tendents of schools. By reading the ranks from 
left to rightopposite a given function it is possible 
to determine the relative importance of a function 
held by each of the two groups of school superin- 
tendents com pared to the jury ofexperts. More 
than casual observation is ne eded to determine 
the extent to which functions deemed to be most 
Important by the superintendents are also those 


functions held to be most important by outstanding 
school superintendents, professors of school ad- 
ministration, and authors composing the jury of 
experts. Rank difference coefficients of correla- 
tion showing the several relationships are pre- 
sented in Table II. Schools with enrollments be- 
tween 200-499, 500-999, and 1000 and over were 
classified respectively for this study as small, 
medium, and large. 

Table II is to be interpr eted in the following 
manner: The coefficient of correlation between 
the relative importance associated with the vari- 
ous administrative functions by any group of super- 
intendents listed under Column (1) and either the 
jury of experts (Column 3) or Oklahoma superin- 
tendents (Column 2) may be determined by reading 
the number appearing under either of the latter 
two columns and opposite the group listed under 
Column (1). For example, the coefficient of cor- 
relation between all Ilinois superintendents in- 
cluded in the study and the jury of experts on rel- 
ative importance of various functions is . 85. 

Data presented in Table I reveal that the 102 
Oklahoma school superintendents responding to the 
study agreed more closely with authoritative opin- 
ion than did the 50 Illinois elementary school dis- 
trict superintendents as to the relative importance 
of the various functions. However, а coefficient 
of .90 between the relative importance of functions 
held by Illinois super intendents compared with 
that indicated by Oklahoma superintendents shows 


a high degree of agreementbetween the two groups 
of superintendents. The most significant finding 
is p robably the fact that when relationships are 
analyzed by categories of SC hool size itis found 
that the relative importance associated with vari- 
ous functions by superintendents tends to increas- 
ingly diverge from authoritative opinion as school 
enrollment size decre ases. The same trend is 
true toa lesser extent when relationships between 
what Oklahoma and Illinois school superintendents 
think important are considered. 

In general, it can ђе concluded that school su- 
perintendents tend to substantially agree with the 
experts in school administration about the relative 
importance of various administrative functions. 
Illinois superintendents of smaller separate ele- 
mentary school districts do not agree as closely 
with authoritative opinion about the relative impor- 
tance of various administrative functions as do su- 

arger similar districts. Russell5 


perintendents inl 
also found asimilar tendency for judgments on the 
relative impo rtance of various functions to vary 


among superintendents according to size of the 
community in which a school was located. 
Relative Im por tance of Functions Grouped by 
Area—Although the 38 functions submitted to re- 
spondents Were intentionally ungrouped all func- 


tions canbe classified into one of nine area group- 
ings. Average weighted ratings were found for all 
functions within each area classification or group- 
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TABLE I 


S, ILLINOIS ELEMENTARY 
ACCORDING TO IMPORTANCE BY JURORS, 
хиаг и 312. AND OKLAHOMA SUPERINTENDENTS 


= 
Rank by 
Illinois Rank by 
Rank by Elementary Oklahoma 
Function of Superintendent 


Jury Superintendents Superintendents 
Employing of teaching, Supervisory and 


administrative personnel 


1.5 1.0 1.0 
Guiding board of education in policy making 1.5 3.5 ~ + 
Preparing budget 5.0 3.5 я : 
Formulating school policies 5.0 8.0 5. 
Establishing and maintaining effective staff 
relationships 5.0 2.0 5.5 
Planning long range building program 5.0 18.0 14.0 
Reporting to the public 5.0 16.5 9.5 
Delegating authority 9.5 14.5 9.5 
Improving personal and professional quali- 
fications of self 9.5 5.0 5.5 
Providing leadership for in-service of staff 9.5 14.5 14.0 
Evaluating school progress 9.5 6.0 14.0 
Assigning, transferring, and dismissing 
Personnel 12.5 16.5 5.5 
Striving for democratic approach to problem 
Sol vin, 12.5 19.5 14.0 
Initiating curriculum Study and revision 14.0 9.0 20.0 
Maintaining working relations with P. T, A. 
groups 15.5 22.0 20.0 
Preparing of building plans 15.5 13.0 9.5 
Supervising the improvement of instructions 17.0 7.0 9.5 
Keeping operation of school in Proper legal 
channels 18.5 11.0 14.0 
Making contacts with Civic, fraternal, and 
religious groups 18.5 24.0 22.5 
Interviewing patrons and representatives of 
local organizations | 20.0 27.5 24,5 
Holding and attending workshops, Clinics, 
institutes 21.5 25.0 27.5 
Employing of clerks, custodians, other non. 
professional personnel 21.5 19.5 17.5 
Preparing courses of ы 23.5 12.0 20.0 
eeping proper recor 5 23.5 
е to federal, state, and local author- 10.0 DS 
ities Г я 25.0 23.0 30. 0 
Establishing proper rapport with Public 26.0 21.0 24.5 
Providing health examinations for Pupils and 
oyees 27.0 
ще Ше maintenance of the school plant 28.0 A : mm d 
Administering the transportation program 29.5 32.0 21.5 
Supervising non-teaching employees 29.5 30.0 33.0 
Taking of school census 32.0 37.0 33.0 
Designing of record forms 2 32.0 36.0 37.0 
Selecting library and other Supplies в 32.0 29.0 30. 0 
Purchasing general supplies (non-educational) 34,5 33.5 33.0 
Supervising attendance Е 34. 5 35.0 30.0 
Administering lunch and other auxiliary programs 36.5 33.5 35.5 
Dealing with pupil problems, including discipline 36.5 26.0 35.5 
Teaching classes 38.0 38.0 38.0 


~ 


YOUNG - DILLMAN 


TABLE II 


RANK DIFFERENCE COEFFICIENTS OF CORRELATION BETWEEN IM- 
CTIONS BASED ON AUTHOR- 


PORTANCE OF ADMINISTRATIVE FUN! 
ITATIVE OPINION AND IMPORTANCE INDICATED BY 


SCHOOL SUPE RINTENDENTS 


(2) 5 (8) 


(1) 
Oklahoma ury of 
Group of Superintendents Superintendents Experts 
All Illinois Superintendents .90 . 85 
Oklahoma Superintendents 1.00 .93 
Illinois Superintendents 1 
(Small Schools) .81 3 
Illinois Superintendents 83 
3 86 4 
(Medium Schools) қ 
. 89 


.92 


Illinois Superintendents 
(Large Schools) 
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ing i r to determine which areas of an ad- 
таи totaltask were deemed to be of great- 
est importance. Table Ш contains average weight- 
ed ratings by area of function for the jury of ex- 
perts and both groups of superintendents. When 
the weighted ratings are int erpreted according to 
the established categories itis evident that both 
groups of superintendents and the experts concur 
that functions related to the Board of Education 
are ‘‘Extremely Important." While the ex perts 
consider functions in the area of public relations 
to be **Extremely Important," both groups of su- 
perintendents indicate they consider them as only 
“Significantly Important." Both the experts and 
Illinois superintendents considered functions relat- 
ed to administrationof personnel as "*Significantly 
Important," while Oklahoma superintendents con- 
sidered them "Extremely Important.” Without 
including functions concerning the employment and 


ees for experts 
the area of staff 


nstruction are *'Signif- 
functions of a Superi 
pupils and of a minis 
erately Important, ” bo 
rated them as *'Signifi 
superintendents and 


irectly to 
ly **Mod- 


» lunch and other 
Moderately Impor- 


dents indicated + 
were “Significantly Important, » hey 


рапсу between 


by Illinois ele- 
mentary school district Superintendents andthe ex- 


ations and pupil per- 
sonnel administration. 


Some insight into the nature 
between the jury of experts and 
tendents on the relative importance of various 
areas of ad m inistrative functions can be obtained 
by noting the specific functions where there was а 
difference of five relative ranks or more between 
them. Below are listed the functions thought to be 
of greater importance by the experts than by Ші. 
nois superintendents: 


of the difference 
Illinois Superin- 


Reporting to the public. 

Striving for a democratic approach to problem 
solving. в 9 А 

Maintaining working relations with РТА groups. 


Interviewing patrons and representatives of lo- 
cal organizations. 

Making contacts with civic, fraternal, and re- 
ligious groups. 

Taking school census. 27 í 

Providing leadership for in-service training 0 
staff. 


Planning long range building program. 


All functions pertaining to the area of public rela- 
tions appear to be considered more important by 
the experts. Functions accorded greater re lative 
importance by Шіпоів superintendents than by the 
experts arelistedbelow. The first three functions 


pertain to administration and improvement of the 
instructional program. 


Initiating curriculum study and revision. 

Supervising the improvement of instruction. 

Preparing courses of study. 

Keeping operation of school in proper legal chan- 
nels. 

Keeping proper records. 

Establishing proper rapport with pupils. 


Dealing with pupil problems, including disci- 
pline. 


The ex pert 
e leadership role of the administrat 


administrato 


importance on functions relating to direct contac 
with pupils than did the experts. what 

Extent to which Superintendents Practice that, 
They Know—Once having discovered the fact 


i е 
in general, school su pe r j s tend to agre 
with authorit Perintendent 


d O аи 

ative judgment about the relative 1 na 
portance oftheir various functi it was then t 
task of the ТЕ - unctions, i 


3 еп“ 
perceived time and 


я sh what 
tures in practice coincided with W to 


An analy 


Validity of time and energy expenditures 0 


А 
—————————— НЕ ~. 
————————— M 


Рӯ 
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TABLE Ш 


COMPARISON OF AVERAGE WEIGHTED RATINGS FOR EXPERTS AND SUPER- 
INTENDENTS BY AREA OF FUNCTION 


Average Weighted Ratings* 


Jury of Illinois Oklahoma 
Area of Function Experts Superintendents Superintendents 

Board of Education 3.6 3.6 3.6 
Public Relations 3.5 3.0 3.3 
Personnel 3.4 8.4 3.5 
Buildings 3.3 3.1 3.4 
Instruction 3.2 3.3 3.3 
Finance 2.6 2.9 5,1 
Ministerial 2.4 2.8 3.0 
Pupils 2.0 2.5 2.7 

2.1 


2.3 


s of weighted ratings were estab- 


*For purposes of interpretation, numerical categorie 


lished as follows: 
Little or No Importance 


0.0 - 1.4 

1.5 - 2.4 Moderately Important 
2.5 - 3.4 Significantly Important 
3.5 - 4.0 Extremely Important 
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; in terms of what is impor- 
tant. ‘These data are presented in Table ГУ which 
tant. interpreted as follows: The relationship 
sepes elative importance of various functions 
a. eived amount of time and energy ех- 
uo aol a дени дн» d by all Illinois superin- 
COP Сасан і ЭЭ study is -.48. This 
a bore ee considered by these superin- 
aoa nts as having the greatest importance tend 
геро “Чоо little’’ time andenergy spent on them 
iw эн same superintendents and that functions 
и и importance tend to have ‘‘too much" 
iine and energy expended on them. | 
Coefficients of rank difference correlation pre- 
sented in Table IV indicate a Significant inverse 
relationship exists between ens of func- 
tions according to the supe rintendents and jury 
of experts' views about the relative importance of 
various functions andthe time and energy expend- 
itures relative to them. "These data indicate that, 
as perceived by supe rintendents, too little time 
and energy on the job are devoted 
which both they and the experts te 
of greatest importance. 
energy are expendedon fu 
tance. These facts wer 
Oklahoma superintenden 
. When analyses were 
size, it appears that thi 


nd to agree are 
Too much time and 
nctions of lesser impor- 
e also found to be true of 
ts to a degree. 


greatest importance 
other two enrollmen 


y were 


f men- 
tion: 


1. Supervising the improvement of instruc- 
tion. | 
2. Providing leadershi 
ing of staff. 
3. Evaluating school progress. 
4. Reporting to the public. 
5. Improving personal and 
ifications of self. 
6. Preparing couses of study. В 
7. Initiating curriculum study and revision. 
8. Holding and attending worksho 
institutes. 


р for in-service train- 


professional qual- 


ps, clinics, 


9. Establishing proper rapport with pupils. 
10. Formulating school policies. 


Observation of the listing reveals that such func- 
tions are largely concerned with evaluation, ан 
riculum development and improvement, staff m 
self-improvement, and relationships with the pu 

ic and pupils. | 

Y Table у contains rank difference coefficients 
of correlation bet ween the importance of various 
functions and the extent to which superintendents 
perceive that they devote “оо little’? time and en- 
ergy. Data indicate that functions on which super- 
intendents devote **too little?’ time and energy are 
those that tendedto have been deemed by both ex- 
perts and Illinois superintendents as most impor- 
tant. Again there is evidence that Illinois super- 
intendents of medium size Schools tend more than 
those of large and smallonesto devote “Чоо little 

time and energy to a lesser extent to more impor- 
tant functions. The difference between the coeffi- 
cients of correlation for all Illinois and Oklahoma 
Superintendents was found to be significant at the 
9? percent level of significance. This would indi- 
cate that Illinois superintendents, more than Okla- 
homa superintendents, perceivedthat they devoted 


“Чоо little’? time and energy to functions deemed 
to be of greatest importance. 


Functions Receiving 


‘Тоо Much” Time and En- 
ergy— Functions concerning ар ie 2105 


rning which Illinois super- 
еу devote “Чоо much” time 
k ed in descending order ac- 
ency with which superintend- 
dents had indicated ‘too much." Following аге 
listed the ten most frequently mentioned functions 
With reference to Which 


superintendents perceive 
they devote ‘too much" time and energy: 


intendents indicated th 
and energy were r an 
cording to the frequ 


l. Reporting to federal, state, and local au- 
thorities. 
2. 


Administering lunch 
programs. 


· Supervising the m aintenance of the school 
plant. 


4. Purchasing Бепегаі supplies (non-edu- 
cational). 


and other auxiliary 


5 non-teaching employees. 

Горег records. | 
ith pupil prob] ems, including 

discipline. 

Designing of re 

Preparing of 

Administerin 


COrd forms. 
building plans. 


g the transportation program. 


8. 
9. 
10. 


Rank difference coef 
Table 


intendents, and the functions pt 
ing whic Superintendents perceive they devo 
“Чоо much’? time. gj 


c" Ў ion- 
gnificant inverse relatio 
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ships are found between what both Illinois super- 
intendents and experts consider as the most im- 
portant functions and those оп whichitis perceived 
that “Чоо much’’ time is being spent. It appears 
that as schoolenrollment size increases, the fre- 
quency with which “оо much" time and energy 
are ex pended on functions of lesser importance 
decreases. There appears to be no significant 
difference between the frequency with which Okla- 
homa and Illinois superintendents devote ‘too 
much" time and energy to functions of lesser im- 
portance. 

Superintendents perceive the time and energy 
expended in regardto certain functions are ‘‘about 
right." Functions wit h respect to which the 
largest number of superintendents indicated their 
time and energy expenditure were ‘about right" 
are listed below in descending order of frequency 
with whichthey were mentioned: 


1. Employing of clerks, cu stodians, other 
non- professional personnel. 
2. Assigning, transferring, and dismissing 
personnel. 
3. Keeping operation of school in proper legal 
channels. 
4. Supervising attendance. 
5. Preparing budget. 
6. Maintaining working relations with PTA 
groups. 
7. Interviewing patrons and repr esentatives 
of local organizations. 
8. Providing health examinations for pupils 
9 E employees. 
| Hon "iie general supplies (non- educa- 
~. хамаа of school census. | 
: mploying of teaching, Supe rvisory, and 
T sed ve personnel. | 
3 5 ecting library and other supplies. 
. Delegating authority. 


More thanthree of every five superintendents per- 
ceived their practice as “about right" intime and 
energy expenditures for eachof the functions list- 
ed. While the function of purchasing non-educa- 
tional supplies was high among the functions ОП 
Which administrators indicated they were devoting 

too much" time and energy, it appears ninth in 
Order among the functions in regard to which, in 


Seneral, time and ergy expenditures are indi- 
"eye Ра is the only 


three listi 1 three listings © 
En вас E ix of the 38 functions. 
Thus it appears that there ! 
agreement among p 
tendents concerning 
tive practice relative 


the evaluation of administra- 
to the various functions. 


Relationship Between Relative Importance of 
Administrative Functions Based on Authoritative 
Judgment and Perceived Time and Energy Expend- 
itures of Illinois Superintendents— Graphical pic- 
tures of the relationship between authoritative 
opinion on importance of functions and relative ade- 
quacy of time and energy expe nditures for each 
function in two area classifications where the di- 
vergence was greatest are presented for Illinois 
superintendents. A similar picture was portrayed 
by Oklahoma administrators. This will reveal at 
a glance the extent to which practice regarding 
time and energy expenditures coincides with rela- 
tive importance of items. Relative rank order 
percentile positions of relative importance were 
ascertained for each function, and they are indi- 
cated by the top bar opposite a function in each in- 
stance. A similarprocedure relative to amount of 
time and energy expenditures was used as a basis 
for the second bar opposite each function. Graph- 
ical presentations of these relationships for other 
areas of function have by necessity been om it- 
ted due to space limitations. 

Instruction— While, as indicated in an earlier 
portion of this report, experts considered the pro- 
vision of leadership by administrators for the in- 
service training ofthe staff as being of higher rel- 
ative importance than did superintendents, Figure 
1 indicates relatively little time and energy are ex- 
pended on this function compared to its position of 
importance. In general, this relationship is true 
for all functions related to the area of instruction. 
These data indicate that interms of the importance 
of administrative functions related to the instruc- 
tional program, school superintendents might well 


examine their present practices in the light ofthe 


total task with a view of increas ing thetime and 


energy expended on this vital area of functions. | 
land Misc ellaneous— Аз shown in 


Ministeria —А 
Figures 2 апа 3, compared to relative importance 
of specific functions in these two areas, a dispro- 


rtionate amount of time andenergy are expended 
on each function so classified. The most exorbi- 
tant expenditure of time and energy in relation to 
relative importance of functions is in relation to su- 
pervising attendance, designing reco га forms, 
completing reports to federal, state, and local au- 
thorities and administering lunch and other auxili- 
ary programs. Keeping records, taking 5 chool 
census and administering the transportation pro- 
gram all receive a disproportionate amount of time 
and energy to a lesser degree. 

Summary of the Findings— Data obtained in this 
study are summarized below to identify concisely 


the major findings: 


1. City school superintendents in Oklahoma ала Ш- 
inois superintendents of separate elementary 
school districts agree closely with each other 
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i ini thorities in the field 
iththe opinions of aul NM 

m sel administration роп ~ me 
arious administrative " 
петна p тена superi ntendents agreed 
Бол closely with authoritative opinion 2 = 

i ini айу 

ivei tance of various adminis т айу 
ос a superintendent than did Illinois 
pem elementary schooldistrict superin- 

tendents. 


istrict enrollment increases, opin- 

: e e a superintendents in both Illinois 

and Oklahoma were foundto agree more close- 

ly with those of authorities on relative impor- 
tance of administrative functions. 


3. Both groups of Superintendents and the experts 
concur that administrative f unc 
to the board of edu 


4. Superintendents concur in associating a high- 
ive importance to functions 
related to pupils, finance, certain miscellan- 


eous functions, and ministeria] functions than 
do the experts. 


relations functi 
of pupil personnel administration. 
6. The experts asso 

greeo importan 


7. As perceived by both groups of superintend- 
“ents, г elatively too little time and energy are 
devoted to administrative functions which both 
they and the experts agree are of greatest rela- 
tive importance andtoo much time and energy 
expendedon functions of lesser importance 
THE onditions are more true for Illinois 
та (њен of schools of small enroll- 
ме коте for those of medium or large size. 
me 


8. 


10. 


11, 


12, 


13. 
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Superintendents of medium size харшаа 
11015 element ary school districts acer 
they devote relatively more time and шагай 
to functions deemed to be of Ыш, = 
importance than superintendents in dis ane 
of either smallor large size enrollments р 
ceive they devote to such functions. 


: ich 
+ Half of the ten top ranked functions on whic 


А де- 
Illinois superintendents perceive they are 
voting tae “Чоо little" time and pui 
pertain to administration of the а iS 
program. Another such function rela nig 
the administrator’s function of improving 18: 
Own personal and professional quud 
Another pertains to reporting to the Lene он 
two, formulating school Policies and evalu 


я i to 
ing school progress, are functions relating 
the Board of Education. 


Illinois elementary schoo] district ie 
tendents, morethan Oklahoma ВС phi wi 
perceive they devoted “Чоо little time p o5 
ergy to functions deemed by the jury and t ons 
Selves as being of greatest relative <a 
tance, but there appeared to be no such di = 
ence between the two groups of s upe sc 
ents concerning items on which relatively ms 
much” time and energy are expended in ter 

of relative importance of functions. 


cerning which Illinois ree ій 
ive they are devoting El А 
time and energy are pamer 
nd miscellaneous in nature. ine 
functions include those of keeping recor 12 
purchasing non-educational su pplies and = g 
pervising non-teaching employees, designi 


1015 
record forms, completing reports to vari 


= 00 
governmental authorities, supervising sch 
plant mainte 


b- 
nance, dealing with pupil Pion 

lems and administering lunch, transpor 

and other auxiliary programs. 


Comparative 
tions of funct 
and energy e 
tendents геу 


analysis of percentile rank roe 
ion importance and relative с 
xpenditures of Шіпоів же на 
eals that functions related to at- 
struction are the ones where there is the pret 
est consistent expenditure of relatively vh 
little” time апа energy in terms of relat кім, 
importance ассогаеа functions by the exper 
This is gen 


to 
erally true of functions related 
the Board of Education. 


Comparative analysis of percentile rank ae 
tion of function importance and relative oe 
and energy expenditures of Illinois siperi 
tendents reveals the ministerial and misc re 
laneous area of functions to be those song 
there is the &reatest consistent expenditure 


гс а оф шилээ 


—— 6 те тг 
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relatively “Чоо much’’ time and energy in 
terms of relative importance accorded func- 
tions by the experts. 


Conclusions and Implications— Based upon the 


findings of this study, the investigators have 
reached certain conclusions and drawn pertinent 
implications as follows: 


1, 


Perceptions of importance held by Illinois 
elementary school superintendents regarding 
relative importance of administrative func- 
tions substantially agree with those held by 
experts in the field of school administration. 


Superintendents of schools know which of their 
functions are of most importance, but for 
some reason in practice they are either 
thwarted from devoting greatest attention to 
the most important aspects of the taskof 
school superintendent, tending without real- 
izing it, to feel more secure in concerning 
themselves with functions requiring less edu- 
cational statesmanship, or by choice are re- 
luctant to concern themselves with tasks of 
greatest importance whichinvolve higher de- 
grees of risk in successfully accomplishing 
them. Further study is needed to reveal the 
causal factors. 


There is astatistically significant inverse re- 
lationship between functions accorded great- 
est importance by Illinois elementary school 
superintendents and their perceived time and 
energy expenditures relative to those functions. 
This is true also concerning what authorities 
consider as important. 


The validity of time and energy expenditures 
of Illinois separate elementary school dis- 
trict superintendents decreases as school en- 
rollment size diminishes, based on authori- 
tative opinion. 


Increased divergence of agreement on the rel- 
ative importance of aschool superintendent’s 
functions between experts and superintendents 
and validity of time and energy expenditures 
which are associated with size of school may 
be attributed to the more generalized nature 
of administrative function as school size de- 
creases, to the lesser amounts and quality of 
training and experience of persons employed 
in smaller schools, or possibly to the fact 
that relative importance of functions may ас- 
tually vary according to enrollment size of 
the school. 


The validity of perceived time and energy ех- 
penditures among Illinois administrators in- 
cluded in the study is less than for Oklahoma 


10. 


11. 


school superintendents based upon authorita- 
tive opinion, particularly with reference to 
time and energy expenditures related to func- 
tions considered of greatest relative impor- 
tance, namely, the area of instruction. 


Illinois superintendents of separate elemen- 
tary school districts could profitably re- 
examine the manner in which they are dis- 
charging the task of school administration 
particularly with reference to their functions 
relating to the Board of Education and admin- 
istration of the instructional program. 


Not only do the results of this study point to 
the desirability for practicing Illinois super- 
intendents to re-examine their practices in 
regard to administrative functions, but they 
also have implications for the content and em- 
phases found in collegeor university adminis- 
trator preparation programs. 


Based on authoritative judgment, elementary 
school superintendents could profitably recon- 
sider the importance of public relations func- 
tions and their role as administrator in the 
improvement of instruction and the curricu- 
lum. 


The writers conclude that similar findings 
would have been obtained if every separate el- 
ementary school district superintendent in П- 
linois had participated in the study, and they 
suspect the picture would be little different if 
superintendents ofunit or separate secondary 
school districts had been included. Improve- 
ment of educational administrative practice 
is the responsibility of poth the practicing ad- 
ministrator and the researcher who may probe 
more deeply into the obstacles and causal fac- 
tors surrounding the condition where greatest 
attention is devoted to functions of lesser im- 

rtance witha view to re-orienting emphasis 


of effort. 


The findings of this study precipitate questions 
concerning basic characteristics of the struc- 
ture within which the system of public educa- 
tion is provided. Is there а ‘‘best’’ size of 
school organization within which the chief ex- 
ecutive is most able to function as а profes- 
sional leader in accomplishing the task? Is it 
pos sible that the basic organizational struc- 
ture, within which the school exists, can fa- 
cilitate or hinder the most effective perform- 
ance of the chief executive in terms of the 
real purpose for which public schools have 
been established? Theoretic ally, assuming 
that three superintendents with identical train- 
ing, experience, competencies, abilities, and 
motivations secured positions inlarge, тес!- 
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um, or small size school systems, respec- in discharging this task of educational ieader- 
tively, in communities with identical charac- 


ship because of more favorable conditions as- 
teristics, is it possible that with equal effort 


sociated with differences in basic organiza- 
one can be more effective than the other two tional structure? 
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A COMPARATIVE STUDY OF STUDENT- 
TEACHING PRACTICES IN THIRTY- 
EIGHT MIDWEST INSTITUTIONS 


GAIL M. INLOW 
Northwestern University 


Introduction—Background and Sample 


AS ONE MEANS of evaluating its program of 
student teaching, the School of Education of North- 
western University recently became interested in 
analyzing student-teaching progra ms of other 
selected colleges and universities in the midwest 
area. Accordingly, in December of 1957, a ques- 
tionnaire was mailed to the other Big Ten Univer- 
Sities and to 43 additional institutions of higher 
learning in the states of Illinois and Indiana. The 
43 deans, chairmen of departments of education, 
and directors of student teac hing whose names 
appeared in the Thirty-Sixth Yearbook, 1957, of 
= Association for Student Teaching, and the 
ана or directors of student teaching of the Big 

еп Universities comprised the mailing sample. 
анар = 52 institutions which received question- 
cent, lone a were returned by 39, ог 75 per” 
ing a ا‎ д was too incomplete to use, leav- 
follows: 8 = sample of 38. Returns Were а5 
questionnaires, | 9 Big Ten Universities returned 
all 5 of the teac 10 of the 11 other universities, 
vate) 52 тете colleges (3 state and 2 pri- 
respond of the 27 liberal-arts colleges. The 
ее sample consists of 18 universities, 5 
Of the 27 je and 15 liberal-arts ек ее 
tacted, 21 ной с in Illinois which ме: J 
the 16 in Indi completed questionnaires; с 
Point А ana, 10 made returns. From this 
MEHE а ыг Study, no distinction will be made 
«жеш е institutions оп the basis of location by 
xL em for the year 1957-58 in the 38 in- 
оин. were listed as follows: 15 tho usand or 
Set 8 аз анаа 71-18 вана, 2;1-9 thou- 
от thousand, 4; 3-5 thousand, 4 22 
and under 500, ens 3; 500-1 thousand, 14 
МИ и average student teaching enrollments on 
1957.211 projected for the remainder of the year 
and 430 were listed as follows: 250 elementary 
eme Secondary for the Big Ten schools, 86 el- 
Siti шагу and 176 secondary ог the other univer 

ies, 174 elementary and 247 secondary for the 


liberal-arts colleges, and 113 elementary and 195 
secondary for all colleges and universities. The 
total for all institutions was 4078 elementary stu- 
dent teachers and 6421 secondary. 

All but one of the 38 institutions, a liberal-arts 
college, had a program of elementary education; 
all but two, both teachers colleges, had a pro- 
gram of secondary education. 

Ten of the 18 universities, 4 of the 5 teachers 
colleges, and 1 of the 15 liberal-arts colleges had 
laboratory schools at the elementary level; 10 of 
the 18 universities, 2 of the 5 teachers colleges, 
and 4 of the 15 colleges had laboratory schools at 
the secondary level. Totals were 15 for elemen- 
tary and 14 for secondary. 


The Purpose 


The purpose of the study was to discover and 


evaluate existing practices in the program of stu- 
dent teaching of other institutions of higher learn- 
ing to the possible and of making changes in the 
program of the School of Education of Northwest- 
ern University in areas where changes seem ed 
warranted. In prief, the study was intended asa 
stimulus for a more effective program evaluation. 
Information was requested in an effort to determine 
comparative practices in the following facets of 


the student-teaching program. | 
1. Ноу шапу credit hours of student teaching 
are required or permitted by the colleges and uni- 
versities which responded to the questionnaire? 
2, What basic patterns of student teaching are 


followed: hours per day, days per week, and num- 


ber of weeks? 

3. How many с 
experiences in both thei 

ject matter? | 
Е: What kinds of compensation are made to 
public and private-school personnel who cooperate 
in the program? . 

5. How are cooperating 

6. How adequate are coop 
lieved to be? 

т. Ном аге 5 


secondary-school students have 
r major and minor fields of 


teachers selected? 
erating teachers be- 


tudents screened prior to accept- 
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dent teaching? ] 
521 Ее practices are followed pri or to stu- 
dent teaching in getting students acquainted with 
i d youth? 
сЕ цах to the role of the college su- 
Geen and an overall evaluation of the 38 pro- 
p ms of student teaching will be publ ished ata 
Tate date. The length of the questionnaire and 
= Ее. of some of the returns led the investi- 
шиг to report the findings in several rather than 
one manuscript. 


The Findings of the Stud 
= I кие DULY 


Credit hours of student teaching. One of the 
current problems of teacher education is to deter- 
mine how many hours of student teaching should 
be required. To discover existing pr actices a- 
mong the various colleges and univer Sities, the 
investigator included the foll Owing item in the 
questionnaire: 

“Within the framework 
gree program, please lis 
the number of credit hou 
required for graduation, 


ofan undergraduate-de- 
t in the following table 
Ts of student te 
If additional 


сопдагу leve]. 
other universities, 1.90 and 3.10 respective ly; 
chers colleges, 3,40 and 2. 66; and liberal-arts 
Se син, 2. 03 and 1. 90. 
со When mean required and additi 
pined, it may be observed that at the elemen- 
com al, the mean averages are аз follows: Big 
tary p өй quarter hours; other universities, 
Ten, ме chers colleges, 15.50; and liberal-arts 
шөлний Жэ 85. At the secondary level, t 
colleges, - 


Otals 
12.00, 12. 50, 14.49, and 9.97 respectively, 
are . 7 


1 ttern of Student 
daily and weekly ра n 
Hour! 


onal hours are 


1 the 
teaching. Of interest to the investigator was 5 
time pattern which the various institutions 5. 
ed іп their унети шеш Data 
returned as presented in Table II. 

E is to be miea from the last line of tanei 
that the dominant pattern of student ае 
both the elementary and secondary шиг «d 
day experience: 19 of the 38 institutions at han 
ementary and 15 of the 36 at the secondary ita 
follow this pattern. Next in quantitative рт 4 
tance is the full-day program with totals ер deg 
8 respectively. Six at the elementary EEL апе 
both patterns; 6 and 2 respectively are ona : im 
per day basis; 3 Secondary on a 1 hour basis; за 
1 secondary опа 3/4 hour basis. All but n n. 
Stitutions have student teaching five days wee hing 

The range in terms of weeks of student е is 
is varied depending on whether the insti 4. 
on a full-day, half-day, or some other Бі. ви 
tional basis. The range at the elem entary пра 
is 9-27 weeks; at the secondary, 8-32. ee 
for the full-day program is 6 to 14 weeks wi a 
ly one university on a 6 week plan; all others 
оп a 9 to 14 week plan. 

Of interest is 
zational differen 


and the Secondary 


ucation of Мог ће 


Same minor differ 
full-d 


Students are е 


Program for two quarters, 
Student tea 


ields. 
Ching in the major and minor oF ye 
candan, controversial areas in the Held of 5©- 
Condary education is whether students вош. 1а 
Sraduated with teachi these 
i ; andifin several, whether dent- 
Competencies 5 ted ina ын већ 
The аиев{ош аге item as in- 
: Problem elicited responses 
dicated in Тађје ш. ten” 
An analysis of Table Ill indicates that d are 
dency of most student teachers, 68.21%, W ond- 
enrolled in the 36 institutions which have а fields. 
ary programs is to teach only in their major" 
The liberal-arts Colleges lean most heavily іп 10 
direction of Specialization whereas the d en 
Schools are the least prone to narrow f concen” 
teaching horizons to only a major area of Secon 1 
Same logic, 45.86% of the тасса! 
ary students in Big Ten schools have a pra elds 
experience іп bo 
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TABLE II 


TIME PATTERN FOR STUDENT TEACHING 


Hours per Da: Days per Week No. of Weeks 
——Hours рег Day | Lays per Week | iem Second. 
пача Elem. Second. Elem. Second. Elem. Second 
Big Ten Universities 3f, d. 364 All 5 d, All 5 а. 10-20 8-2 
4h. d. 2h. 4. 
1 № Ans. 1-2 hr. 
1-1 hr. 
1 No Ans 
Other Universities 5 f. d. 31. d. All 5 d. All 5 d. 9-24 9-8 
5 h. d. Sh. d. 
2b 1-2 hr 
1-3/4 br 
All Universities 6 f. d. 6f. d AIL5 d, All5 d. 9-25 8-32 
9 h. d. Th. d. 
2 b. 2-2 hr. 
1 № Апз 1-1 hr. 
1-3/4 hr, 
1 Мо Ans, 
Teachers Colleges Lf. d. 1f. а. 1-41/2d. 1-24 12-27 8-19 
1h. 4. 2 h. d. 1-3 d. 2-5 d. 
3 b. ?NoPgm.  3.5q 
Liberal-Arts Colleges 2 f. d. 15 8. АЦ 5а, т -18 
9h. а. 6h. 4. АШЫ re > 
1b. 4-2 hr. 
2-2 hr. 2-1 hr, 
1 Мо Ans 2 Мо Ans 
All Colleges зг. d. 2f. а, 18-5 4. 17-5 а 9-27 8-19 
pa a iy 5 M4124 1-24. 
А -2 hr, 1-3 а, 
2-2 hr. 2-1 hr. 2 Ко Pem 
1 №. Ans. 2 Ко Апз. 
— 9f. d. 81. d. 36-5 а, 35-5 - 8-32 
All 19h. d 15h. а. liia Loa ма 
6 b. 6-2 hr, 1-3 а, 2 Кот 
2-2 hr. 3-1 hr. No Pgm 
2NoAns. 1-3/4 hr. 
5 No Ans 


Legend: f. d. = full day; h. d. = half day; b. = 


-= No Program. 
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At Northwestern University, student 2. 
who are under the Au ene dear не 23 = 4 
i eriences in | 
Заа | with the exception of e t М 5 мэ due 
onomics an - 
on ER E Е а of the Staff of the 
Ра of Education favor breadth of teaching pre- 
paration; of the staff of a liberal-arts coll 
iali reparation. 
и ers 5 the cooperating schools. How 
many colleges and universities pay teachers or 
school systems for the services which they ren- 
der the student teaching progr am? How many, 
and in what way, do institutions Compensate other 
than by monetary payments to teachers or school 
systems? Answers were Sought to these and oth- 
er related questions about compensation for stu- 
dent teaching services Which are perfor med by 
those public and private schools which cooperate 
in the teacher-education Programs of the 38 re- 


spondent institutions. Pertinent findings are con- 
tained in Table IV, 


ере, 


ап institutions. 
Universities 
dthecolleges, 


The median was $9.08. The 


Paid an average of $6.81 an 
$5.31. 


The mean insti 
dent teaching enrollm 


per year, 
Two Schools, both teach 


payment to the schoo] System, no 
themselves. The 


direct payment to 


О not pay either teachersor 
the school board, 


AS may be noted in Table IV, 6 ins 


some kind to 


; 20 extended li 
vileges; 10 permitted cooperating teac 


tend special events at the college or universit я 
and 10 listed the names of teachers in the College 
catalog. One institution which makes no direct 

ayments to teachers or a schoo] board stated that 
it had $90, 000 to $120,000 per year for tuition 
5 fits to teachers in the Cooperating school sys- 
E for in-service purposes, workshops and 
Lage еленді? meetings, | 
рго her miscellaneous methods of compensation 
шэн ted as follows, with parenthetical figures 
were 11 the number of institutions which used 
panee medium: pay principal (1), give din- 
the Р) give teas (2), make union facil ities д. 
ners d 


brary pri- 
hers to at- 


j to 
vailable (1), provide equipment (2), s pie ا ا‎ 
professional journals for teachers and € "ÉL 
finance workshops (2), invite teachers rs 10.00 
meetings (1), and charge а tuition fee о 
per student which goes to the teacher (1). түгээ 
Northwestern University makes по dire 0 
ment to teachers but extends limited س‎ e- 
fits, gives faculty rates to teachers for weit ados 
vents, extends library privileges, and inc 
cooperating personnel on its mailing list. 42 A 
How are cooperating teachers selecte 4 4 бы 
problem that regularly seems to plague tea e the 
education institutions is howto discover eon 
fact whether a teacher is likely to be an m 
cooperating teacher. To get acquainted will ад 
parative practices, the following item was in 
ed in the questionnaire: t- 
What methods do you use in selecting gp ан 
ing teachers? After each method listed, ym м 
indicate how effective the method is by plac y. E. 
check under one of the fourlis ted En «Нуе; 
for very effective, M.E. for moderately effe very 
1. for moderately ineffective, andV.I. for 
ineffective, | V. 
Responses to the item are presented in Table 
Symbols are employed as indicated above. may 
From the data in Table V, no conclusions 


cting 
be reached about the best way or ways of sele 
supervising teachers. 


` nid 
etent are Supervisi | 
the 38 institutions beli Ё 
meet 
rating teachers are believed to 
high Standard 


ent- 
eclared competency of pae T 
агу and Secondary. scr oo] teachers. The к . 
OWS: in the elementary one = гев” 
were believeg to meet high standar a 
ularly or t5 do an adequate job; in the secon etin£ 
field, 91, Percent were identified as ше ities 
these Same Standards. The Big Ten Univers 
were the most Pessimistic, labeling 12.2 ре ade- 
Of the Secondary School teachers asless than 
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quate--either borderline cases or persons who 
i ed. 
Ш теа tha investigator, the percent- 
ages subscribed to by the Big Ten DE ШЕ 
аге fairly realistic. If he differed a а : da R 
lean somewhat more to the pessimistic han о Ше 
optimistic side. Не believes that sec uring Е 
rvices of enough top-notch teachers is one о: 
the big problems in teacher education today. 
The screening of student-teaching a licants. 
In the area of screening for student teaching, the 
institutions were presented with a list of nine 
methods or techniques and asked to respond witha 
symbol R if they routinely used the method; W if 
they used it when warranted; and N if they used it 
rarely or never. Only the responses of R or W 
have been included in Table УП. 
As an analysis of Table УП 
and varied are the approaches 
institutions to the problem of s 


indicates, many 
of the participating 
creening applicants, 
nly опе method-- 


Once more the fact is significant that the num- 
ber of methods and devic 


er hand, were 
ional and charac- 


as part of their Screening proce- 
dures. 


As indicated in Table УП, 15 institutions 
employed standardized instruments for Screening 
purposes: academic aptitude tests, English Usage 
tests, reading tests, Personality inventories, and 
attitude inventories. The Specific instr um ents 
listed were the ACE, SCAT, Miller Analogy, Ohio 
Psychological, Otis, Kuder personality, М.М. Р. 
1., and the Minnesota Teacher Attitude Scale. 

Northwestern University regularly requires q 
physical exam, Student-teaching interview, pro- 
fessional references, and an acceptable academic 
record. When warranted, it requires а Psychiat- 
ric exam, Speech test, and an occasional stang- 
ardized test in English usage. | 

How students, prior to student teac hing, are 
helped to understand children апа Outh. An item 
waa included in the questionnaire war eft Шеті 
was п r practices which other institutions em- 
discove n pre-Student-teaching experience to 
ployed е6 achieve a better unders tanding of 
help stu nd youth. Specifically it asked: 
children a. llege or university has noorganized 

Мез of ind — Prior to the senior 

га i 

ща what procedures, ifany, are used to help 


students discover the interests, abilities, 20 
tion spans, range of normalcy and other 7 и 
teristics of children and youth; and Ше leade ==. 
traits and interests of the students themselves? 


Three of the 38 institutions seemed to hav «ie 
well organized program of observ ation and - to 
dent-teaching participation which begins prior бү 
the senior year. One of the three, a universi = 
requires for both elementary and sec ondary 5 oh 
dents three semester hours of observation in see 
of the freshman and sophomore years, permits 
five semester hours of student teaching in the ~ 
ior year, and requires Нуе semester hours ak 
dent teaching during the senior year. А teac са 
college requires 4 quarter hours of student Кер 
ing in the elementary program during the невине 
year and requires 8 more in the senior year is = 
4 additional permitted. А second teachers’ col Е, 
requires for elementary students 3 semester hou 6 
of student teaching during the sophomore er 
during the junior year, and permits 6 additional 
during the senior year, 


As a means of 


getting students acquainted with 
children and 


youth, probably second only to -а 
dent teaching itself, is the making of vis pon 
classrooms, Respondent sample m e mbers pon 
cated that visits by studenfs were made in t а 
manner indicated below. The number of воен" 
and universities which utilize each methodis lis 
ed in the parentheses which follow each item. "s 

l. Visit in connection with courses in educ 
tional psychology and child development (10). 1) 

Visit in connection with methods courses( Ё 

3. Visit in connection with courses іп educa 

tional sociology and human development (2). 


E ігесі- 
Visit in connection with a course in dire 
ed observation (7). 


5. Visit with 
Make freq 
The 36 institu 


an entire class (1). 
uent observations (10). ib 
tions made 37 Е ura 

Classroom visits. One of the 10 institutions У red 
admitted to making frequent ob s e r vations эргэ 
that 20 to 25 were made by each student in the nd 
lor year. Another Stated that 10 obs ao e gis У 
ach course in educational PSY 
al psychology. = 
ions to асоба, in order өлі и 
п, Мав youth work in service а^ 
Se some doubt existed in the еве 
of the investigator about whether some ae re- 
регі ге required oroptional, ехас 
Plies are provid 

1. Youth act 


ig 
Sequence (for both elementary and secondary) «В 
Ten University) 


ior 


Р ѕег- 
дс", Leadership experience іп com munity 
vice agenci 


псїев, (Big Ten University) у п. 
5. Non-teaching experience with 811479 
(Other than 


Big Ten University) 


а Bv 
Instituted this year--preteaching club a¢ 


~ -- о и ---+-+--4КЕЕ=Е=-=-ЫСЁЫ-----+-=--+-+:ъ----Ҥ{{[.+ч" 
ИИ r 


TABLE УП 


METHODS AND TECHNIQUES OF SCREENING APPLICANTS 


All 
Inst. 


All 
Colleges 


Lib -Arts 
Colleges 


Teachers 
Colleges 


All Uni. 


Other Uni. 


Big Ten 


Screening Devices 


mE 


Qn 


mE 


соо 


Physical Exam. 


0-W 


2-R 


mE 


яч 


Psychiatric Exam. 
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mE mE 
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+ со C 
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mE ав 
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Student-Teaching Interview 


Speech Test 


38-R 


15-R 20-R 


18-R 5-R 


10-R 


Academic Record 


ет 


~AN 


mE 


ос 


Written Faculty References 


4-W 


8-R 


mE 


хоо 


ав 


о ч 


Written Character References 


ат 


= сч 


Autobiography 


ат 


pee 


Standardized Tests and 


Inventories 


method used only when warranted. 


method used routinely; W 
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ities at freshman and ap toners and junior levels. 
Big Ten Universi 

[pes ies d experience--two semesters of 

work in an approved social agency (junior year), 

(Other than Big Ten University) 

6. Students are encouraged to serve as rec- 

reational leaders, en , for groups of children. 
College 
Lap bem 2 provided toward getting as 
broad experience as possible with children and 
youth in church, and all types of youth organiza- 
tions--also summer camps. (Liberal-Arts Col- 
n Recommendation for participation with 
youth groups such as scouts, Sunday Schoolcamps. 
(Liberal-Arts College) 

9. Directed leadership experience with Youth 
Day camps, summer camps. (Liberal A rts Col- 
lege 

27 Sunday school work, camp couns eling, 
playground work. (Liberal-Arts College) 

11. Playground work in Summer camp Counsel- 
ing...teaching Sunday school classes. (Liber al 
Arts College) 

12. Experiences with children inand o ut of 
School &roups--Boy and Girl Зсоц ts, Sunda y 
School classes, е{с.... Summer camp experien- 
ces. (Liberal-Arts College) 

Of the 36 institutio 


а cryptic reply "courses"; another, “попе other 
than class work, ›› 


1. September experience (4) 
2. Future Teachers of America meetings (2) 
3. Teacher-aide activity (1) 
4. Conferences (1) 
5. Resource persons fr 
6. Classes taught by la 
theory instructors (1) 


7. Orientation tests and iny 


om local со, 


mmunity(1) 
boratory 


School and 


entories (1) 4. . 
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all students here must take two prep AME 
courses) prior to student teac ing. 7 
а 5 Да the Child (psychologically a ~ 
iented) and School and Society (orientati сви 
school and community and to soc iology of е тай 
tion). Individual professors prob ab ly dom ан 
than some others for students inthis resp мене 
Those that believe whole-heartedly in рее 
tory experiences for students do a great , 
others do very little. 22 
Тһе School of Education of Nor thwestern 2 
versity has two approaches to pre pimidentetea ue 
ing activities with children and youth: occasio Ёс 
observations in schools in connection with edu s 
tion courses (and these observations vary in раз 
ber and quality dependent оп the individua ys 
Structor), and a requirement of 100 hours of W = 
with children and youth. These 100 hours ми 
Spent in camp Counseling, community centers, 


CAN e 
Scouting organizations, YMCA's, andsimilar typ 
organizations. 


Summary 


In an effort to secure information which = 
be used in evaluating the program of the schoo 2 
Education of Northwestern University, sri ape 
naire was sent to 52 teacher-training inse 
to get an overview of comparative practices. 
of these were Big Ten Universities; the other П- 
жеге colleges and universities in the states of 
linois and Indi: 


Schools in those States. Replies were receive 
from 39 or 75 


or 2-hour programs g 
day students taught for a per tution 
9eks; all but those of one sd 

е range for the remainder 


from 6 to 14 у 
from 9 to 14, 
-32 weeks, 


~ 1 1 t the 

In the opinion of the investigator, ins Я Utions, ~ Л ME en Eun, ddr pcc jeld 

with only 3 or 4 exceptions, revealed aminterest 420 Concentration 29-13, нд. ци ht in bo ин 

in pre-student-teaching experiences eithe: у des- У major and minor field aer dn schools a 
cribing activities performed or by ехрге Sing re- , 4 Уве way with 45. 86 5. te каје Ме sui teac 

greis. hat e жегі бана обаға оф! ing in both fields, "one OF het tly t? 

mos eply came N irty institut: irec 
Bie Кеша: It is as follows: | Bad Бий (28), tutions made payments dire Sr 
This phase of our teacher preparation needs у 


reatest strengthening. Most of us in Student 

oh a feel that we are weak here, We dothe 
sual sort of thing: observation, some visitation 
Es а rather irregular basis, and the like. However, 


15 
9r to the cooperating schoo 

tems (2), е mean amerage payment to + ng 
was $6.06 per duarter hour of student tea "g~ 
Other means of Compensation were in tuitio? 72. 
efits, faculty 


P р 
Tates to college events, library 
Vileges, and other incidental media. 


Cooperating teachers were shown to be select- 
ed їп а variety of ways, with no one method being 
markedly superior to any other. Probably the 
best method was for college perso nnel to visit 
the teachers and classrooms, observe and get ас- 
quainted, and then place a student during a subse- 
quent term. By and large, cooperating teachers 
were described as almost always mee ting high 
standards or as doing an adequate job. Fewer than 
10 percent were jdentified as borderline cases or 
in need of replacement. 

Student teaching applicants were screene din 
many ways, with half or more of the respondents 
agreeing on the use of academic records, a phys- 
ical examination, a speech test, and an interview 
as screening devices. 

The two most commonly used methods of get- 
ting students acquainted with children and youth 
prior to the time of student teaching were listed 
as classroom visitation to public and laboratory 
schools, and work with youth in service agencies. 
Four institutions employed the September-Exper- 


jence technique. 


Conclusions 


In attempting to relate the findings of the study 
to possible program revision, a number of tenta- 
tive conclusions may be drawn: 

1. Programs were earmarked as much by their 
variety as by their similarity. Ranges of data 
were so tremendous at times that medians were 
more appropriate measures of central tendency 
than means. 

2. Yet within the framework of variety, а sur- 
prising similarity existed among the institutions 
of different kinds. The investigator got the im- 
pression that differences am ong certain liberal- 
arts colleges, for instance, Were much greater 
than the differences among Selected liberal-ar ts 
Colleges, teachers colleges; and universities that 
had a comparable philosophy of teacher education. 

3. With few exceptions, the thirty-eight insti- 
tutions used the state requirement of 5 semester 
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hours of education (6 for elementary inMinnesota) 
as a guide line only. The more importantquestion 
seemed to be how little or how much it should be 
exceeded in the light of sound educational practice. 
Of importance is the fact that few significant dif- 
ferences were revealed between ће time schedules 
which were employed insecondary and elementary 
programs. 

4. The data are conclusive in indic ating that 
payment of teachers is a common practice among 
the 38 respondent institutions. Of equal signifi- 
cance may be the fact that 8 of the respondent in- 
stitutions do not pay and yet have acceptable pro- 
grams of student teaching. 

5. The institutions are basically satisfied with 
the cooperating teachers with whom they work. 
The job of the future may be to secure the services 
of other well qualified teachers so that the current 
glow of satisfaction will not disappear. 

6. Little seems to be done in screening in the 
area of personality. The fact that 29 of the 38 in- 
stitutions used the student-teacher interview aS a 
screening device may be a move inthe right direc- 
tion. But more probably needs to be done in the 
future with clinical instruments and psychiatric 
methods. 

7. The majority of the colleges and universi- 
ties were actively engaged in providing laboratory 
experiences with children and youth as part of the 
pre-student-teaching phase of the program. 


In the last analysis, program revision will de- 
pend more emphatically on the philosophy of the 
institution than on current practices of other insti- 
tutions of higher learning. Yet ап enlightened in- 
stitution is bound to be interested in comparative 
practices as one of many sources ofdata. The job 
of the School of Education seems to be to agree on 
those goals of education which are consonant with 
its basic philosophy, pay heed to what psy chology 
and related disciplines have to offer, observe what 
practices are being successfully followed еїве- 


where, and then proceed with caution in the area 
of program revision. 
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